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CLEAN VERSION OF ALL CLAIMS 

Cancel claims 1-27, all the claims in this case, and 
substitute new claims 28-49 as follows: 

28. An enzyme, designated as phospholipid: diacylglycerol 
acyltransf erase (PDAT) , catalyzing in an acyl-CoA-independent 
reaction the transfer of fatty acids from phospholipids to 
diacylglycerol in the biosynthetic pathway for the production of 
triacylglycerol and comprising an amino acid sequence as set forth 
in SEQ ID NO: 2 or a functional fragment, derivate, allele, homolog 
or isoenzyme thereof* 

29. An enzyme according to claim 28, comprising an amino acid 
sequence encoded through a nucleotide sequence as set forth in SEQ 
ID NO:l or a homologous nucleotide sequence which is at least about 
40% identical to a nucleotide sequence of SEQ ID NO. 1. 

30. An enzyme according to claim 28, comprising an amino acid 
sequence as set forth in SEQ ID NO. 16, 20 or 22, or a functional 
fragment, derivate, allele, homolog or isoenzyme thereof. 

31. An enzyme according to claim 28, comprising an amino acid 
sequence selected from the group consisting of sequences as set 
forth in SEQ ID NO. 6, 8, 13, 14, 15, 17, 18, 25 and 27, a 
functional fragment, derivate, allele, homolog or isoenzyme 
thereof . 

32. An enzyme according to claim 28, comprising an amino acid 
sequence encoded through a nucleotide sequence, a portion, 
derivate, allele or homolog thereof selected from the group 
consisting of sequences as set forth in SEQ ID NO. 1, 3, 4, 5, 7, 
9, 10, 11, 12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, or a 
functional fragment, derivate, allele, homolog or isoenzyme of the 
enzyme encoding amino acid sequence. 

33. A nucleotide sequence according to claim 32, selected 
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from the group consisting of sequences as set forth in SEQ ID NO. 
1, 3, 4, 10, 11, 19, 21, 23, 24, 29 and 30, or a portion, derivate, 
allele or homolog thereof. 

34. A partial nucleotide sequence corresponding to a full 
length nucleotide sequence according to claim 32, selected from the 
group consisting of sequences as set forth in SEQ ID NO. 5, 7, 9, 
12, 25, 26, 28 and 31, or a portion, derivate, allele or homolog 
thereof . 

35. A nucleotide sequence according to claim 32, comprising 
a nucleotide sequence which is at least 40% identical to a 
nucleotide sequence selected from the group consisting of those 
sequences set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 11, 12, 
19, 21, 23, 24, 25, 26, 28, 29, 30 and 31. 

36. A gene construct comprising a nucleotide sequence as set 
forth in SEQ ID No. 1, or a homologous nucleotide sequence which is 
at least about 40% identical to the nucleotide sequence of SEQ ID 
No. 1, which is operably linked to a heterologous nucleic. 

37. A vector comprising a gene construct according to claim 
36, or the nucleotide sequence as set forth in SEQ ID No. 1, or a 
homologous nucleotide sequence which is at least about 40% 
identical to the nucleotide sequence of SEQ ID No. 1. 

38. A vector according to claim 37, which is an expression 
vector. 

39. A vector according to claim 37, further comprising a 
selectable marker gene and/or nucleotide sequences for the 
replication in a host cell or the integration into the genome of 
the host cell. 

40. A transgenic cell or organism comprising one or more of 
the following: 

a) a nucleotide sequence a x ) to a 4 ) , 
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b) a gene construct b x ) , and 

c) a vector c x ) , 
wherein 

a x ) is a nucleotide sequence as set forth in SEQ ID NO. 1 or a 
homologous nucleotide sequence which is at least about 40% 
identical to a nucleotide sequence of SEQ ID NO. 1, 

a 2 ) is a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of 
sequences as set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 11, 
12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, or a functional 
fragment, derivate, allele, homolog or isoenzyme of the enzyme 
encoding amino acid sequence, 

a 3 ) is a partial nucleotide sequence which corresponds to a full 
length nucleotide sequence selected from the group consisting 
of sequences as set forth in SEQ ID NO. 5, 7, 9, 12, 25, 26, 
28 or 31, or a portion, derivate, allele or homolog thereof; 

a 4 ) is a nucleotide sequence which is at least 40% identical to a 
nucleotide sequence selected from the group consisting of 
those sequences set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 
11, 12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, 

b x ) is a gene construct comprising a nucleotide sequence a ) , 
operably linked to a heterologous nucleic acid, and 

o x ) is a vector comprising a gene construct b^ ) , or a nucleotide 
sequence a x ) . 

41. A transgenic cell or organism according to claim 40, 
which is an eucaryotic cell or organism. 

42. A transgenic cell or organism according to claim 40, 
which is a yeast cell or a plant cell or a plant. 

43. A transgenic cell or organism according to claim 40 
having an altered biosynthetic pathway for the production of 
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triacylglycerol, characterized by the prevention of accumulation of 
undesirable fatty acids, which are harmful if present in high 
amounts in membrane lipids. 

44. A transgenic cell or organism according to claim 40 
having an altered, increased oil content. 

45. A transgenic cell or organism according to claim 40, 
wherein the activity of PDAT is altered, characterized by an 
alteration in gene expression, catalytic activity and/or regulation 
of activity of the enzyme. 

46. A process for the production of triacylglycerol, 
comprising growing a transgenic cell or organism according to claim 
40 under conditions whereby the said nucleotide sequence 

a x ) is a nucleotide sequence as set forth in SEQ ID NO. 1 or a 
homologous nucleotide sequence which is at least about 40% 
identical to a nucleotide sequence of SEQ ID NO. 1, 

a 2 ) is a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of 
sequences as set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 11, 
12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, or a functional 
fragment, derivate, allele, homolog or isoenzyme of the enzyme 
encoding amino acid sequence, 

a 3 ) is a partial nucleotide sequence which corresponds to a full 
length nucleotide sequence selected from the group consisting 
of sequences as set forth in SEQ ID NO. 5, 7, 9, 12, 25, 26, 
28 or 31, or a portion, derivate, allele or homolog thereof; 

a 4 ) is a nucleotide sequence which is at least 40% identical to a 
nucleotide sequence selected from the group consisting of 
those sequences set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 
11, 12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, 

is expressed. 
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47. Triacylglycerols produced by a process according to claim 

46. 

48, Use of a nucleotide sequence selected from the group of 
a x ) is a nucleotide sequence as set forth in SEQ ID NO* 1 or a 

homologous nucleotide sequence which is at least about 4 0% 
identical to a nucleotide sequence of SEQ ID NO. 1, 

a 2 ) is a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of 
sequences as set forth in SEQ ID NO. 1, 3, 4, 5, 7 , 9, 10, 11, 
12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, or a functional 
fragment, derivate, allele, homolog or isoenzyme of the enzyme 
encoding amino acid sequence, 

a 3 ) is a partial nucleotide sequence which corresponds to a full 
length nucleotide sequence selected from the group consisting 
of sequences as set forth in SEQ ID NO. 5, 7, 9, 12, 25, 26, 
28 or 31, or a portion, derivate, allele or homolog thereof; 

a 4 ) is a nucleotide sequence which is at least 40% identical to a 
nucleotide sequence selected from the group consisting of 
those sequences set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 
11, 12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, 

or an enzyme selected from the group of 

d 2 ) is an enzyme, designated as phospholipid: diacylglycerol 
acyltransf erase (PDAT) , catalyzing in an acyl-CoA-independent 
reaction the transfer of fatty acids from phospholipids to 
diacylglycerol in the biosynthetic pathway for the production 
of triacylglycerol and comprising an amino acid sequence as 
set forth in SEQ ID NO. 2 or a functional fragment, derivate, 
allele, homolog or isoenzyme thereof, 

d 2 ) is an enzyme qi ) / comprising an amino acid sequence as set 
forth in SEQ ID NO. 16, 20 or 22, or a functional fragment, 
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derivate, allele, homolog or isoenzyme thereof, 
d 3 ) is an enzyme d 1 ) , comprising an amino acid sequence selected 
from the group consisting of sequences as set forth in SEQ ID 
NO* 6, 8, 13, 14, 15, 17, 18, 25 and 27, or a functional 
fragment, derivate, allele, homolog or isoenzyme thereof, 
for the production of triacylglycerol and/or triacylglycerols with 
uncommon fatty acids, comprising medium chain fatty acids, 
hydroxylated fatty acids, epoxygenated fatty acids and acetylenic 
fatty acids. 

49. Use of a nucleotide sequence selected from the group of 

a 2 ) is a nucleotide sequence as set forth in SEQ ID NO. 1 or a 
homologous nucleotide sequence which is at least about 40% 
identical to a nucleotide sequence of SEQ ID NO. 1, 

a 2 ) is a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of 
sequences as set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 11, 
12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, or a functional 
fragment, derivate, allele, homolog or isoenzyme of the enzyme 
encoding amino acid sequence, 

a 3 ) is a partial nucleotide sequence which corresponds to a full 
length nucleotide sequence selected from the group consisting 
of sequences as set forth in SEQ ID NO. 5, 7, 9, 12, 25, 26, 
28 or 31, or a portion, derivate, allele or homolog thereof; 

a 4 ) is a nucleotide sequence which is at least 40% identical to a 
nucleotide sequence selected from the group consisting of 
those sequences set forth in SEQ ID NO. 1, 3, 4, 5, 7, 9, 10, 
11, 12, 19, 21, 23, 24, 25, 26, 28, 29, 30 and 31, 

or an enzyme selected from the group of 

d a ) is an enzyme, designated as phospholipid: diacylglycerol 
acyltransferase (PDAT) , catalyzing in an acyl-CoA-independent 
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reaction the transfer of fatty acids from phospholipids to 
diacylglycerol in the biosynthetic pathway for the production 
of triacylglycerol and comprising an amino acid sequence as 
set forth in SEQ ID NO. 2 or a functional fragment, derivate, 
allele, homolog or isoenzyme thereof, 
d 2 ) is an enzyme d x ) , comprising an amino acid sequence as set 
forth in SEQ ID NO. 16, 20 or 22, or a functional fragment, 
derivate, allele, homolog or isoenzyme thereof, 
d 3 ) is an enzyme d x ) , comprising an amino acid sequence selected 
from the group consisting of sequences as set forth in SEQ ID 
NO. 6, 8, 13, 14, 15, 17, 18, 25 and 27, or a functional 
fragment, derivate, allele, homolog or isoenzyme thereof, 
for the transformation of any cell or organism in order to be 
expressed in this cell or organism and result in an altered, 
preferably increased oil content of this cell or organism. 
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REMARKS 

The claims have been amended to eliminate multiple dependency 
and to place them in better form for U.S. practice. Further, 
amendments made in the international stage, albeit not under 
Article 19, are also incorporated. The changes made in the claims 
were as follows: 

i. the subject-matter of claim 2 and 3 was included in claim 1 
(now claim 28) ; 

ii. claim 29 was introduced according to the disclosed homologous 
nucleotide sequence on page 6 of the specification; 

iii. claims 7, 8 and 25 were deleted; 

iv. claims 30-42 essentially correspond to claims 4-6 and 9-18 of 
the international application; 

v. claim 23 was incorporated into claim 19, which is now claim 
43; 

vi claim 20, now 44, was amended by defining the altered oil 

content according to page 9, line 9; 
vii. claim 22 was incorporated into claim 21, which is now claim 

45; 

viii* claim 46 essentially corresponds to claim 24 of the 

international application . 
ix. in claim 26, now 47, triacylglycerols with uncommon fatty 

acids were defined according to page 10, line 29; 
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x. claim 48 essentially corresponds to claim 27 of the 
international application; 

xi. in the other claims, editorial amendments were made. 
Favorable action on the application is solicited* 



Respectfully submitted, 



KEIL & WEINKAUF 




Herbert B. Keil 
Reg. No. 18,967 



1101 Connecticut Avenue, N.W. 
Washington, D.C. 20036 
(202) 659-0100 
HBK/kas 
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^ IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



BOX PCT 



3 m 

•X \$ re Application of 

^ IaAD£sft DAHLQUIST et al. 

Serial No. 09/937,779 

Filed: September 28, 2001 

For: NEW CLASS OF ENZYMES IN THE BIOSYNTHETIC PATHWAY FOR THE 
PRODUCTION OF TRIACYCLOGLYCEROL AND RECOMBINANT DNA 
MOLECULES ENCODING THESE ENZYMES 

I hereby certify that this correspondence is being deposited with the 
United States Postal Service as first class mail in an envelope 
addressed to Commissioner of Patents and Trademarks, 
Washington, D.C. 20231, on: 

June 5, 2002 



Date of Deposit _ 

Karen Stamper 

Person Making Deposit JJr* y, ~, ^ A 

l9na (Tune ^, 20 0? 

Date of Signature 



Honorable Commissioner of 

Patents and Trademarks ~y 
Washington, D.C. 20231 



PRELIMINARY AMENDMENT 
AND 

RESPONSE TO NOTIFICATION OF MISSING REQUIREMENTS UNDER 35 USC 371 



Sir: 

In response to the Notification of Missing Requirements under 35 USC 371 , a 
copy of the Sequence Listing in computer readable form is attached hereto. The 
content of the paper copy of the Sequence Listing and the copy of the Sequence Listing 
in computer readable form is the same, and includes no new matter. 
IN THE SPECIFICATION 

Delete the sequence listing in the specification on pages 1/58 to 58/58 and 
substitute with the attached replacement sequence listing on separate pages 1-52. 



REMARKS 

It is believed that by submitting the present amendment and sequence listing 
diskette, the application now fully complies with the requirements of 37 CFR 1 .821 - 
1 .825. Favorable action by the examiner is solicited. 

Please charge any shortage in fees due in connection with the filing of this 
paper, including Extension of Time fees to Deposit Account No. 1 1-0345. Please credit 
any excess fees to such account. 

Respectfully submitted, 
KEIL & WEINKAUF 



0&/13/E002 NKMMGH 00000090 110345 09937779 
01 FCsl54 130.00 CH 
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BASF-NAE 3377/99 PCT ^ 1 12.09.2000 

A NEW CLASS OF ENZYMES " lTHyEJ3IO^^ P ATHWAY FOR THE 

PRODUCTION OF T^IACY LGLYCEROL AND R ECOMBINANT DNA 
MOy^UL^^ I NGJEHES E-ENZYM ES 

5 The present invention relates to the isolation, identification and characterization 
of recombinant DNA molecules encoding enzymes catalysing the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway for 
the production of triacylglycerol. 



10 Triacylglycerol (TAG) is the most common lipid-based energy reserve in nature. 
The main pathway for synthesis of TAG is believed to involve three sequential 
acyl-transfers from acyi-CoA to a glycerol backbone (1, 2). For many years, 
acyl-CoA : diacylglycerol acyltransferase (DAG AT), which catalyzes the third 
acyi transfer reaction, was thought to be the only unique enzyme involved in 

15 TAG synthesis. It acts by diverting diacylglycerol (DAG) from membrane lipid 
synthesis into TAG (2). Genes encoding this enzyme were recently identified 
both in the mouse (3) and in plants (4, 5), and the encoded proteins were 
shown to be homologous to acyl-CoA : cholesterol acyltransferase (ACAT). It 
was also recently reported that another DAGAT exists in the oleaginous fungus 

20 Mortierella ramanniana, which is unrelated to the mouse DAGAT, the ACAT 
gene family or to any other known gene (6). 

The instant invention relates to novel type of enzymes and their encoding 
genes for transformation. More specifically, the invention relates to use of a 

25 type of genes encoding a not previously described type of enzymes hereinafter 
designated phospholipid:diacylg!ycerol acyltransferases (PDAT), whereby this 
enzyme catalyses an acyl-CoA-independent reaction. The said type of genes 
expressed alone in transgenic organisms will enhance the total amount of oil 
(triacylglycerois) produced in the cells. The PDAT genes, in combination with a 

30 gene for the synthesis of an uncommon fatty acid will, when expressed in 
transgenic organisms, enhance the levels of the uncommon fatty acids in the 
triacylglycerois. 
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There is considerable interest world-wide in producing chemical feedstock, 
such as fatty acids, for industrial use from renewable plant resources rather 
than non-renewable petrochemicals. This concept has broad appeal to 
manufacturers and consumers on the basis of resource conservation and 
5 provides significant opportunity to develop new industrial crops for agriculture. 

There is a diverse array of unusual fatty acids in oils from wild plant species 
and these have been well characterised. Many of these acids have industrial 
potential and this has led to interest in domesticating relevant plant species to 
10 enable agricultural production of particular fatty acids. 

Development in genetic engineering technologies combined with greater 
understanding of the biosynthesis of unusual fatty acids now makes it possible 
to transfer genes coding for key enzymes involved in the synthesis of a 
15 particular fatty acid from a wild species into domesticated oilseed crops. In this 
way individual fatty acids can be produced in high purity and quantities at 
moderate costs. 

In all crops like rape, sunflower, oilpaim etc., the oil (i.e. triacylglycerofs) is the 
20 most valuable product of the seeds or fruits and other compounds like starch, 
protein, and fibre is regarded as by-products with less value. Enhancing the 
quantity of oil per weight basis at the expense of other compounds in oil crops 
would therefore increase the value of crop. If genes, regulating the allocation of 
reduced carbon into the production of oil can be up-regulated, the cells will 
25 accumulate more oil on the expense of other products. Such genes might not 
only be used in already high oil producing cells, such as oil crops, but could 
also induce significant oil production in moderate or low oil containing crops 
such as e.g. soy, oat, maize, potato, sugarbeats, and turnips as well as in 
micro-organisms. 

30 

Summary of the invention 
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Many of the unusual fatty acids of interest, e.g. medium chain fatty acids, 
hydroxy fatty acids, epoxy fatty acids and acetylenic fatty acids, have physical 
properties that are distinctly different from the common plant fatty acids. The 

5 present inventors have found that, in plant species naturally accumulating 
these uncommon fatty acids in their seed oil (i.e. triacylglycerol), these acids 
are absent, or present in very low amounts in the membrane (phospho)lipids of 
the seed. The low concentration of these acids in the membrane lipids is most 
likely a prerequisite for proper membrane function and thereby for proper cell 

10 functions. One aspect of the invention is that seeds of transgenic crops can be 
made to accumulate high amounts of uncommon fatty acids if these fatty acids 
are efficiently removed from the membrane lipids and channelled into seed 
triacylglycerols. 

15 The inventors have identified a novel class of enzymes in plants catalysing the 
transfer of fatty acids from phospholipids to diacylglycerol in the production of 
triacylglycerol through an acyi-CoA-independent reaction and that these 
enzymes (phospholipid:diacylglycerol acyitransferases, abbreviated as PDAT) 
are involved in the removal of hydroxylated, epoxygenated fatty acids, and 

20 probably also other uncommon fatty acids such as medium chain fatty acids, 
from phospholipids in plants. 

This enzyme reaction was shown to be present in microsomal preparations 
from baker's yeast (Saccharomyces cerevisiae). The instant invention further 
pertains to an enzyme comprising an amino acid sequence as set forth in SEQ 

25 ID No. 2 or a functional fragment, derivate, allele, homolog or isoenzyme 
thereof. A so called .knock out' yeast mutant, disrupted in the respective gene 
was obtained and microsomal membranes from the mutant was shown to 
totally lack PDAT activity. Thus, it was proved that the disrupted gene encodes 
a PDAT enzyme (SEQ ID NO. 1 and 2). Furtherm, this PDAT enzyme is 

30 characterized through the amino acid sequence as set forth in SEQ ID NO 2 
containing a lipase motif of the conserved sequence string FXKWVEA. 
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The instant invention pertains further to an enzyme comprising an amino acid 
sequence as set forth in SEQ ID NO. 16, 20 or 22 or a functional fragment, 
derivate, allele, homolog or isoenzyme thereof. 

Further genes and/or proteins of so far unknown function were identified and 
5 are contemplated within the scope of the instant invention. A gene from 

Schizosaccharomyces pombe, SPBC776.14 (SEQ ID. NO. 3), a putative open 

reading frame CAA22887 of the SPBC776.14 (SEQ ID NO. 13) were identified. 

Further Arabidopsis thaliana genomic sequences (SEQ ID NO. 4, 10 and 11) 

coding for putative proteins were identified, as well as a putative open reading 
10 frame AAC80628 from the A. thaliana locus AC 004557 (SEQ ID NO. 14) and a 

putative open reading frame AAD10668 from the A. thaliana locus AC 003027 

(SEQ ID NO. 15) were identified. 

Also, a partially sequenced cDNA clone from Neurospora crassa (SEQ ID NO. 
9) and a Zea mays EST (Extended Sequence Tac) clone (SEQ ID NO. 7) and 

15 corresponding putative amino acid sequence (SEQ ID NO. 8) were identified. 
Finally, two cDNA clones were identified, one Arabidopsis thaliana EST (SEQ 
ID NO. 5 and corresponding predicted amino acid sequence SEQ ID NO. 6) 
and a Lycopersicon esculentum EST clone (SEQ ID NO. 12) were identified. 
Further, enzymes designated as PDAT comprising an amino acid sequence 

20 selected from the group consisting of sequences as set forth in SEQ ID NO 6, 
17, 18, 25 or 27 containing a lipase motif FXKWVEA are contemplated within 
the scope of the invention. Moreover, an enzyme comprising an amino acid 
sequence encoded through a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of sequences as set forth 

25 in SEQ ID No. 1, 3, 4, 5, 7, 9, 10, 11, 12, 19, 21, 23, 24, 25, 26, 28, 29, 30 or 
31 or a functional fragment, derivate, allele, homolog or isoenzyme of the 
enzyme encoding amino acid sequence are included within the scope of the 
invention. 

30 A functional fragment of the instant enzyme is understood to be any 
polypeptide sequence which shows specific enzyme activity of a 
phospho!ipid:diacylglycerol acyltransferase (PDAT). The length of the functional 
fragment can for example vary in a range from about 660 ± 10 amino acids to 
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660 ± 250 amino acids, preferably from about 660 ± 50 to 660 ±100 amino 
acids, whereby the „basic number" of 660 amino acids corresponds in this case 
to the polypeptide chain of the PDAT enzyme of SEQ ID NO. 2 encoded by a 
nucleotide sequence according to SEQ ID NO. 1. Consequently, the „basic 
5 number" of functional fullength enzyme can vary in correspondance to the 
encoding nucleotide sequence. 

A portion of the instant nucleotide sequence is meant to be any nucleotide 
sequence encoding a polypeptid which shows specific activity of a 
phospholipid:diacyiglycero! acyltransferase (PDAT). The length of the 

10 nucleotide portion can vary in a wide range of about several hundreds of 
nucleotides based upon the coding region of the gene or a highly conserved 
sequence. For example the length varies in a range form about 1900 ± 10 to 
1900 ± 1000 nucleotides, preferably form about 1900 ± 50 to 1900 ±700 and 
more preferably form about 1900 ± 100 to 1900 ± 500 nucleotides, whereby the 

15 „basic number" of 1900 nucleotides corresponds in this case to the encoding 
nucleotide sequence of the PDAT enzyme of SEQ ID NO. 1. Consequently, the 
„basic number" of functional fullength gene can vary. 

An allelic variant of the instant nucleotide sequence is understood to be any 
20 different nucleotide sequence which encodes a polypeptide with a functionally 
equivalent function. The alleles pertain naturally occuring variants of the instant 
nucleotide sequences as well as synthetic nucleotide sequences produced by 
methods known in the art. Contemplated are even altered nucleotide 
sequences which result in an enzyme with altered activity and/or regulation or 
25 which is resistant against specific inhibitors. The instant invention further 
includes natural or synthetic mutations of the originally isolated nucleotide 
sequences. These mutations can be substitution, addition, deletion, inversion 
or insertion of one or more nucleotides. 

30 A homologous nucleotide sequence is understood to be a complementary 
sequence and/or a sequence which specifically hybridizes with the instant 
nucleotide sequence. Hybridizing sequences include similar sequences 
selected from the group of DNA or RNA which specifically interact to the instant 
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nucleotide sequences under at least moderate stringency conditions which are 
known in the art. A preferred, non-limiting example of stringent hybridization 
conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at 
about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50- 
5 65°C. This further includes short nucleotide sequences of e.g. 10 to 30 
nucleotides, preferably 12 to 15 nucleotides. Included are also primer or 
hybridization probes. 

A homologous nucleotide sequence included within the scope of the instant 
io invention is a sequence which is at least about 40%, preferably at least about 
50 % or 60%, and more preferably at least about 70%, 80% or 90% and most 
preferably at least about 95%, 96%, 97%, 98% or 99% or more homologous to 
a nucleotide sequence of SEQ ID NO. 1. 

All of the aforementioned definitions are true for amino acid seqences and 
15 functional enzymes and can easily transfered by a person skilled in the art. 

isoenzymes are understood to be enzymes which have the same or a similar 
substrate specifity and/or catalytic activity but a different primary structure. 

20 In a first embodiment, this invention is directed to nucleic acid sequences that 
encode a PDAT. This includes sequences that encode biologically active 
PDATs as well as sequences that are to be used as probes, vectors for 
transformation or cloning intermediates. The PDAT encoding sequence may 
encode a complete or partial sequence depending upon the intended use. Ail or 

25 a portion of the genomic sequence, cDNA sequence, precursor PDAT or 
mature PDAT is intended. 

Further included is a nucleotide sequence selected from the group consisting of 
sequences set forth in SEQ ID No. 1 , 3, 4. 10, 1 1 , 1 9, 21 , 23, 24, 29 or 30 or a 
30 portion, derivate, allele or homolog thereof. The invention pertains a partial 
nucleotide sequence corresponding to a fullength nucleotide sequence selected 
from the group consisting of sequences set forth in SEQ ID No. 5, 7, 9, 12, 25, 
26, 28 or 31 or a portion, derivate, allele or homolog thereof. Moreover, a 
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nucleotide sequence comprising a nucleotide sequence which is at least 40% 
homologous to a nucleotide sequence selected form the group consisting of 
those sequences set forth in SEQ ID No. 1, 3, 4, 5, 7, 9, 10, 11, 12, 19,21, 23, 
24, 25, 26, 28, 29, 30 or 31 is contemplated within the scope of the invention. 

5 

The instant invention pertains to a gene construct comprising a said nucleotide 
sequences of the instant invention which is operably linked to a heterologous 
nucleic acid. 

The term operably linked means a serial organisation e.g. of a promoter, coding 
10 sequence, terminator and/or further regulatory elements whereby each element 
can fulfill its original function during expression of the nucleotide sequence. 

Further, a vector comprising of a said nucleotide sequence of the instant 
invention is contemplated in the instant invention. This includes also an 
15 expression vector as well as a vector further comprising a selectable marker 
gene and/or nucleotide sequences for the replication in a host cell and/or the 
integration into the genome of the host cell. 

In a different aspect, this invention relates to a method for producing a PDAT in 
20 a host cell or progeny thereof, including genetically engineered oil seeds, yeast 
and moulds or any other oil accumulating organism, via the expression of a 
construct in the cell. Cells containing a PDAT as a result of the production of 
the PDAT encoding sequence are also contemplated within the scope of the 
invention. 

25 

Further, the invention pertains a transgenic ceil or organism containing a said 
nucleotide sequence and/or a said gene construct and/or a said vector. The 
object of the instant invention is further a transgenic cell or organism which is 
an eucaryotic ceil or organism. Preferably, the transgenic cell or organism is a 
30 yeast cell or a plant cell or a plant. The instant invention further pertains said 
transgenic cell or organism having an altered biosynthetic pathway for the 
production of triacylglycerol. A transgenic cell or organism having an altered oil 
content is also contemplated within the scope of this invention. 
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Further, the invention pertains a transgenic cell or organism wherein the activity 
of PDAT is altered in said cell or organism. This altered activity of PDAT is 
characterized by an alteration in gene expression, catalytic activity and/or 
5 regulation of activity of the enzyme. Moreover, a transgenic ceil or organism is 
included in the instant invention, wherein the altered biosynthetic pathway for 
the production of triacylglycerol is characterized by the prevention of 
accumulation of undesirable fatty acids in the membrane lipids. 

10 In a different embodiment, this invention also relates to methods of using a 
DNA sequence encoding a PDAT for increasing the oil-content within a cell. 

Another aspect of the invention relates to the accommodation of high amounts 
of uncomman fatty acids in the triacylglycerol produced within a cell, by 
15 introducing a DNA sequence producing a PDAT that specifically removes these 
fatty acids from the membrane lipids of the cell and channel them into 
triacylglycerol. Plant cells having such a modification are also contemplated 
herein. 

20 Further, the invention pertains a process for the production of triacylglycerol, 
comprising growing a said transgenic ceil or organism under conditions 
whereby the said nucleotide sequence is expressed and whereby the said 
transgenic cells comprising a said enzyme catalysing the transfer of fatty acids 
from phospholipids to diacylgiycerol forming triacylgiycerol. 

25 

Moreover, triacylglycerols produced by the aforementioned process are 
included in scope of the instant invention. 

Object of the instant invention is further the use of an instant nucleotide 
30 sequence and/or a said enzyme for the production of triacylglycerol and/or 
triacylglycerols with uncommon fatty acids. The use of a said instant nucleotide 
sequence and/or a said enzyme of the instant invention for the transformation 
of any cell or organism in order to be expressed in this cell or organism and 
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result in an altered, preferably increased oil content of this cell or organism is 
also contemplated within the scope of the instant invention. 

A PDAT of this invention includes any sequence of amino acids, such as a 
5 protein, polypeptide or peptide fragment obtainable from a microorganism, 
animal or plant source that demonstrates the ability to catalyse the production 
of triacylglycerol from a phospholipid and diacylglycerol under enzyme reactive 
conditions. By „enzyme reactive conditions" is meant that any necessary 
conditions are available in an environment (e.g., such factors as temperature, 
io pH, lack of inhibiting substances) which will permit the enzyme to function. 

Other PDATs are obtainable from the specific sequences provided herein. 
Furthermore, it will be apparent that one can obtain natural and synthetic 
PDATs, including modified amino acid sequences and starting materials for 

15 synthetic-protein modelling from the examplified PDATs and from PDATs which 
are obtained through the use of such examplified sequences. Modified amino 
acid sequences include sequences that have been mutated, truncated, 
increased and the like, whether such sequences were partially or wholly 
synthesised. Sequences that are actually purified from plant preparations or 

20 are identical or encode identical proteins thereto, regardless of the method 
used to obtain the protein or sequence, are equally considered naturally 
derived. 

Further, the nucleic acid probes (DNA and RNA) of the present invention can 
25 be used to screen and recover homologous" or „re(ated" PDATs from a variety 
of plant and microbial sources. 

Further, it is also apparent that a person skilled in the art can, with the 
information provided in this application, in any organism identify a PDAT 
30 activity, purify an enzyme with this activity and thereby identify a ^non- 
homologous" nucleic acid sequence encoding such an enzyme. 
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The present invention can be essentially characterized by the following 
aspects: 

1 . Use of a PDAT gene (genomic clone or cDNA) for transformation. 
5 2. Use of a DNA molecule according to item 1 wherein said DNA is used for 
transformation of any organism in order to be expressed in this organism 
and result in an active recombinant PDAT enzyme in order to increase oil 
content of the organism. 

3. Use of a DNA molecule of item 1 wherein said DNA is used for 
io transformation of any organism in order to prevent the accumulation of 

undesirable fatty acids in the membrane lipids. 

4. Use according to item 1, wherein said PDAT gene is used for transforming 
transgenic oil accumulating organisms engineered to produce any 
uncommon fatty acid which is harmful if present in high amounts in 

15 membrane lipids, such as medium chain fatty acids, hydroxylated fatty 

acids, epoxygenated fatty acids and acetylenic fatty acids. 

5. Use according to item 1, wherein said PDAT gene is used for transforming 
organisms, and wherein said organisms are crossed with other oil 
accummulating organisms engineered to produce any uncommon fatty acid 

20 which is harmful if present in high amounts in membrane lipids, comprising 

medium chain fatty acids, hydroxylated fatty acids, epoxygenated fatty 
acids and acetylenic fatty acids. 

6. Use according to item 1, wherein the enzyme encoded by said PDAT gene 
or cDNA is coding for a PDAT with distinct acyl specificity. 

25 7. Use according to item 1 wherein said PDAT encoding gene or cDNA, is 
derived from Saccharornyces cereviseae : or contain nucleotide sequences 
coding for an amino acid sequence 30% or more identical to the amino acid 
sequence of PDAT as presented in SEG. ID. NO. 2. 
8. Use according to item 1 wherein said PDAT encoding gene or cDNA is 

30 derived from Saccharornyces cereviseae. or contain nucleotide sequences 

coding for an amino acid sequence 40% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 
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9. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 60% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

5 10. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 80% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 
11. Use according to item 1 wherein said PDAT encoding gene or cDNA is 

10 derived from plants or contain nucleotide sequences coding for an amino 

acid sequence 40% or more identical to the amino acid sequence of PDAT 
from Arabidcpsis thaliana or to the protein encoded by the fullength 
counterpart of the partial Zea mays, Lycopericon esculentum, or 
Neurospora crassa cDNA clones. 

15 12. Transgenic oil accumulating organisms comprising, in their genome, a 
PDAT gene transferred by recombinant DNA technology or somatic 
hybridization. 

13. Transgenic oil accumulating organisms according to item 12 comprising, in 
their genome, a PDAT gene having specificity for substrates with a 

20 particular uncommon fatty acid and the gene for said uncommon fatty acid. 

14. Transgenic organisms according to item 12 or 13 which are selected from 
the group consisting of fungi, plants and animals. 

15. Transgenic organisms according to item 12 or 13 which are selected from 
the group of agricultural plants. 

25 16. Transgenic organisms according to item 12 or 13 which are selected from 
the group of agricultural plants and where said PDAT gene is expressed 
under the control of a storage organ specific promotor. 

17. Transgenic organisms according to item 12 or 13 which are selected from 
the group of agricultural plants and where said PDAT gene is expressed 

30 under the control of a seed promotor. 

18. Oils from organisms according to item 12-17. 

19. A method for altering acyl specificity of a PDAT oy alteration of the 
nucleotide sequence of a naturally occurring encoding gene and as a 
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consequence of this alternation creating a gene encoding for an enzyme 
with novel acyl specifity. 
20. A protein encoded by a DNA molecule according to item 1 or a functional 
fragment thereof. 

5 21. A protein of item 20 designated phospholipid:diacylglycerol acyltransferase. 

22. A protein of item 21 which has a distinct acyl specificity. 

23. A protein of item 13 having the amino acid sequence as set forth in SEG, 
ID NO. 2, 13, 14 or 15 (and the proteins encoded by the fultength or partial 
genes set forth in SEQ. ID. NO. 1, 3, 4, 5, 7, 9, 10, 11 or 12) or an amino 

10 acid sequence with at least 30 % homology to said amino acid sequence. 

24. A protein of item 23 isolated from Saccharomyces cereviseae. 



15 



20 General m ethods: 

Yeast strains and plasmids . The wild type yeast strains used were either 
FY1679 {MATa his3-A200 Ieu2-A1 trp1-A6 ura3-52) or W303-1A (MATa ADE2- 
1 cani-100 his3-11,15 leu2-3,112 trp1-1 ura3-1) (7). The YNR008w::KanMX2 
disruption strain FVKT004-04C(AL), which is congenic to FY1679, was 

25 obtained from the Euroscarf collection (8). A 2751 bp fragment containing the 
YNR008w gene with 583 bp of 5' and 183 bp of 3' flanking DNA was amplified 
from W303-1A genomic DNA using Taq polymerase with 5'- 
TCTCCATCTTCTGCAAAACCT-3' and S'-CCTGTCAAAAACCTTCTCCTC^' as 
primers. The resulting PCR product was purified by agarose gel electrophoresis 

so and cloned into the EcoPN site of pBluescript (pbluescript-pdat). For 
complementation experiments, the cloned fragment was released from 
pBluescript by Hin6\\\-Sac\ digestion and then cloned between the Hin6\\\ and 
Sad sites of pFL39 (9), thus generating pUS1. For overexpression of the PDAT 
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genej a 2202 bp EcoR\ fragment from the pBluscript plasmid which contains 
only 24 bp of 5' flanking DNA was cloned into the BamHI site of the GAL1- 
TPK2 expression vector pJN92 (12), thus generating pUS4. 

5 Microsomal preparations. Microsomes from developing seeds of sunflower 

{Helianthus annuus), Ricinus communis and Crepis palaestina were prepared 
using the procedure of Stobart and Stymne (11). To obtain yeast microsomes, 
tg of yeast cells (fresh weight) was re-suspended in 8 ml of ice-cold buffer (20 
mM Tris-CI, pH 7.9, 10 mM MgCI 2) 1 mM EDTA. 5 % (v/v) glycerol, 1 mM DTT, 

to 0.3 M ammonium sulfate) in a 12 ml glass tube. To this tube, 4 ml of glass 
beads (diameter 0.45-0.5 mm) were added, and the tube was then heavily 
shaken (3 x 60 s) in an MSK cell homogenizer (B. Braun Melsungen AG, 
Germany). The homogenized suspension was centrifuged at 20,000 x g for 15 
min at 6 C C and the resulting supernatant was again centrifuged at 100,000 x g 

15 for 2 h at 6°C. The 100,000 x g pellet was resuspended in 0.1 M potassium 
phosphate (pH 7.2), and stored at -80°C. It is subsequently referred to as the 
crude yeast microsomal fraction. 

Lipid substrates. Radio-labeled ricinoleic (12-hydroxy-9-octadecenoic) and 
20 vernolic (12,13-epoxy-9-octadecenoic) acids were synthesized enzymatically 
from [1- 14 C]oleic acid and [1- 14 C]linoleic acid, respectively, by incubation with 
microsomal preparations from seeds of Ricinus communis and Crepis 
palaestina, respectively (12). The synthesis of phosphatidylcholines (PC) or 
phosphatidylethanolamines (PE) with 14 C-labe!ed acyl groups in the sn-2 
25 position was performed using either enzymatic (13), or synthetic (14) acyiation 
of [ 14 C]oleic, [ 14 C]ricinoIeic, or [ 14 C]vernolic acid. Dioleoyl-PC that was labeled 
in the sn~1 position was synthesized from sn-1-[ 14 C]oleoyl-lyso-PC and 
unlabeled oleic acid as described in (14). Sn-1 -o!eoyl-sn-2-[ 14 C]ricinoleoyl-DAG 
was synthesized from PC by the action of phospholipase C type XI from B. 
30 Cereus (Sigma Chemical Co.) as described in (15)- Monovernoloyl- and 
divemoleoyi-DAG were synthesized from TAG extracted from seeds of 
Euphorbia iagascae, using the TAG-!ipase (Rizhopus arrhizus, Sigma Chemical 
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Co.) as previously described (16). Monoricinoleoyl-TAG was synthesized 
according to the same method using TAG extracted from Castor bean. 

Lipid analysis. Total lipid composition of yeast were determined from cells 

5 harvested from a 40 ml liquid culture, broken in a glass-bead shaker and 
extracted into chloroform as described by Bligh and Dyer (17), and then 
separated by thin layer chromatography in hexane/diethylether/acetic acid 
(80:20:1) using pre-coated silica gel 60 plates (Merck). The lipid areas were 
located by brief exposure to \ 2 vapors and identified by means of appropriate 

10 standards. Polar lipids, sterol-esters and triacylglycerols, as well as the 
remaining minor lipid classes, referred to as other lipids, were excised from the 
plates. Fatty acid methylesters were prepared by heating the dry excised 
material at 85 °C for 60 min in 2% (v/v) sulfuric acid in dry methanol. The 
methyl esters were extracted with hexane and analyzed by GLC through a 50 m 

15 x 0.32 mm CP-Wax58-CB fused-silica column (Chrompack), with 
methylheptadecanoic acid as an internal standard. The fatty acid content of 
each fraction was quantified and used to calculate the relative amount of each 
lipid class. In order to determine the total lipid content, 3 m! aliquots from yeast 
cultures were harvested by centrifugation and the resulting pellets were washed 

20 with distilled water and (yophilized. The weight of the dried cells was 
determined and the fatty acid content was quantified by GLC-anaiyses after 
conversion to methylesters as described above. The lipid content was then 
calculated as nmol fatty acid (FA) per mg dry weight yeast. 

25 Enzyme assays. Aliquots of crude microsomal fractions (corresponding to 

10 nmol of microsomal PC) from developing plant seeds or yeast cells were 
iyophilized over night. 14 C-Labeled substrate lipids dissolved in benzene were 
then added to the dried microsomes. The benzene was evaporated under a 
stream of N 2) feaving the lipids in direct contact with the membranes, and 0.1 

30 ml of 50 mM potassium phosphate (pH 7.2) was added. The suspension was 
thoroughly mixed and incubated at 30°C for the time period indicated, up to 90 
min. Lipids were extracted from the reaction mixture using chloroform and 
separated by thin layer chromatography in hexane/diethylether/acetic acid 
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(35:70:1.5) using silica ge! 60 plates (Merck). The radioactive lipids were 
visualized and quantified on the plates by electronic autoradiography (Instant 
Imager, Packard, US). 

5 Yeast cultivation. Yeast cells were grown at 28°C on a rotatory shaker in 

liquid YPD medium (1% yeast extract, 2% peptone, 2% glucose), synthetic 
medium (18) containing 2% (v/v) glycerol and 2% (v/v) ethanol, or minimal 
medium (19) containing 16 g/l of glycerol. 

10 The instant invention is further characterized by the following examples which 
are not limiting: 

Acyl-CoA-independent synthesis of TAG by oil seed microsomes, A large 
number of unusual fatty acids can be found in oil seeds (20). Many of these 

15 fatty acids, such as ricinoleic (21) and vernolic acids (22), are synthesized 
using phosphatidylcholin (PC) with oieoyl or Jinoieoyl groups esterified to the 
sn-2 position, respectively, as the immediate precursor. However, even though 
PC can be a substrate for unusual fatty acid synthesis and is the major 
membrane lipids in seeds, unusual fatty acids are rarely found in the 

20 membranes. Instead, they are mainly incorporated into the TAG. A mechanism 
for efficient and selective transfer of these unusual acyi groups from PC into 
TAG must therefore exist in oil seeds that accumulate such unusual fatty acids. 
This transfer reaction was biochemically characterized in seeds from castor 
bean (Hicinus communis) and Crepis palaestina 7 plants which accumulate high 

25 levels of ricinoleic and vernolic acid, respectively, and sunflower (Helianthus 
annuus), a plant which has only common fatty acids in its seed oil. Crude 
microsomal fractions from developing seeds were incubated with PC having 
14 C-labeied oieoyl, ricinoleoyl or vernoloyf groups at the sn-2 position. After the 
incubation, lipids were extracted and analyzed by thin layer chromatography. 

30 We found that the amount of radioactivity that was incorporated into the neutral 
lipid fraction increased linearly over a period of 4 hours (data not shown). The 
distribution of [ 14 C]acyi groups within the neutral lipid fraction was analyzed 
after 80 min (Fig. 1). Interestingly the amount and distribution of radioactivity 
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between diffferent neutral lipids were strongly dependent both on the plant 
species and on the type of [ 14 C]acyi chain. Thus, sunflower microsomes 
incorporated most of the label into DAG, regardless of the type of [ 14 C]acyl 
group. In contrast, Ft. communis microsomes preferentially incorporated 

5 [ 14 C]ricinoleoyl and [ 14 C]vernoloyl groups into TAG, while [ 14 C]oleyl groups 
mostly were found in DAG. C. palaestina microsomes, finally, incorporated only 
[ 14 C]vernolyol groups into TAG, with [ 14 C]ricinoleyl groups being found mostly 
as free fatty acids, and [ 14 C]oleyl groups in DAG. This shows that the high in 
-vivo levels of ricinoleic acid and vernoiic acid in the TAG pool of R communis 

10 and C. palaestina, respectively, can be explained by an efficient and selective 
transfer of the corresponding acyl groups from PC to TAG in these organisms. 

The in-vitro synthesis of triacylgiycerols in microsomal preparations of 
developing castor bean is summarized in table 1 . 

15 

PDAT: a novel enzyme that catalyzes acyl-CoA independent synthesis of 
TAG. It was investigated if DAG could serve both as an acyl donor as well as 
an acyl acceptor in the reactions catalyzed by the oil seed microsomes. 
Thererfore, unlabeled divernoloyl-DAG was incubated with either sn-1-oieoyl- 

20 sn-2-[ 14 C]ricinoleoyi-DAG or sn-1 -oIeoyl-sn-2-[ 14 C]ricinoleoyl-PC in the 
presence of R. communis microsomes. The synthesis of TAG molecules 
containing both [ 14 C]ricinoleoyl and vernoloyl groups was 5 fold higher when 
[ 14 C]ricinoleoyl-PC served as acyl donor as compared to [ 14 C]ricinoieoyi-DAG 
(fig. 1 B). These data strongly suggests that PC is the immediate acyl donor and 

25 DAG the acyl acceptor in the acyl-CoA-independent formation of TAG by oil 
seed microsomes. Therefore, this reaction is catalyzed by a new enzyme which 
we call phospholipid : diacylglycerol acyltransferase (PDAT). 

PDAT activity in yeast microsomes. Wild type yeast ceils were cultivated 
30 under conditions where TAG synthesis is induced. Microsomal membranes 
were prepared from these cells and incubated with s/7-2-[ 14 C]-ricinoleoyUPC 
and DAG and the 14 C-labeled products formed were analyzed. The PC-derived 
[ 14 C]ncino!eoyl groups within the neutral lipid fraction mainly were found in free 
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fatty acids or TAG, and also that the amount of TAG synthesized was 
dependent on the amount of DAG that was added to the reaction (Fig.2). The in 
vitro synthesis of TAG containing both ricinoleoyl and vernoloyi groups, a TAG 
species not present in vivo, from exogenous added sn-2-[ 14 C]ricinoleoyl-PC and 
unlabelled vernoloyl-DAG (Fig. 2, lane 3) clearly demonstrates the existence of 
an acyl-CoA-independent synthesis of TAG involving PC and DAG as 
substrates in yeast microsomal membranes. Consequently, TAG synthesis in 
yeast can be catalyzed by an enzyme similar to the PDAT found in plants. 



10 The PDAT encoding aene in yeast. 

A gene in the yeast genome (YNR008w) is known, but nothing is known about 
the function of YNR008w, except that the gene is not essential for growth under 
normal circumstances. Microsomal membranes were prepared from the yeast 
strain FVKT004-04C(AL) (8) in which this gene with unknown function had been 
is disrupted. PDAT activity in the microsomes were assayed using PC with 
radiolabelled fatty acids at the sn-2 position. The activity was found to be 
completely absent in the disruption strain (Fig. 2 lane 4). Significantly, the 
activity could be partially restored by the presence of YNR008w on the single 
copy plasmid pUS1 (Fig. 2 lane 5). Moreover, acyl groups of 
phosphatidylethanoiamine (PE) were efficiently incorporated into TAG by 
microsomes from the wild type strain whereas no incorporation occured from 
this substrate in the mutant strain (data not shown). This shows that YNR008w 
encodes a yeast PDAT which catalyzes the transfer of an acyl group from the 
sn-2 position of phospholipids to DAG, thus forming TAG. It should be noted 
that no cholesterol esters were formed from radioactive PC even in incubations 
with added ergosterols, nor were the amount of radioactive free fatty acids 
formed from PC affected by disruption of the YNR008w gene (data not shown). 
This demonstrates that yeast PDAT do not have cholesterol ester synthesising 
or phospholipase activities. 



20 



25 



30 



Increased TAG content in yeast calls that overexpres s PDAT The effect 
of overexpressing the PDAT-encoding gene was studied by transforming a wild 
type yeast strain with the pUS4 plasmid in which the gene is expressed from 
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the galactose-induced GAL1 :TPK2 promoter. Ceils containing the empty 
expression vector were used as a control. The cells were grown in synthetic 
glycerot-ethanol medium, and expression of the gene was induced after either 2 
hours (early log phase) or 25 hours (stationary phase) by the addition of 
5 galactose. The cells were then incubated for another 21 hours, after which they 
were harvested and assays were performed. We found that overexpression of 
PDAT had no significant effect on the growth rate as determined by the optical 
density. However, the total lipid content, measured as pmol fatty acids per mg 
yeast dry weight, was 47% (log phase) or 29% (stationary phase) higher in the 

10 PDAT overexpressing strain than in the control. Furthermore, the polar lipid and 
sterolester content was unaffected by overexpression of PDAT. Instead, the 
elevated lipid content in these cells is entirely due to an increased TAG content 
(Fig. 3A,B). Thus, the amount of TAG was increased by 2-foid in PDAT 
overexpressing early Jog phase cells and by 40% in stationary phase cells. It is 

15 interesting to note that a significant increase in the TAG content was achieved 
by overexpressing PDAT even under conditions (i.e. in stationary phase) where 
DAGAT is induced and thus contributes significantly to TAG synthesis. In vitro 
PDAT activity assayed in microsomes from the PDAT overexpressing strain 
was 7-fold higher than in the control strain, a finding which is consistent with the 

20 increased levels of TAG that we observed in vivo (Fig. 3C). These results 
clearly demonstrate the potential use of the PDAT gene in increasing the oil 
content in transgenic organisms. 

Substrate specificity of yeast PDAT. The substrate specificity of yeast 
25 PDAT was analyzed using microsomes prepared from the PDAT 
overexpressing strain (see Fig. 4). The rate of TAG synthesis, under conditions 
given in figure 4 with di-oleoyl-PC as the acyl-donor, was 0.15 nmol per min 
and mg protein. With both oleoyl groups of PC labeled it was possible, under 
the given assay conditions, to detect the transfer of 1 1 pmol/min of [ 14 C]oleoyl 
30 chain into TAG and the formation of 15 pmol/min of iyso-PC. In microsomes 
from the PDAT-deficient strain, no TAG at all and only trace amounts of Iyso- 
PC was detected, strongly suggesting that yeast PDAT catalyses the iormation 
of equimolar amounts of TAG and Iyso-PC when supplied with PC and DAG as 
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substrates. The fact that somewhat more lyso-PC than TAG is formed can be 
explained by the presence of a phospholipase in yeast microsomes, which 
produces lyso-PC and unesterified fatty acids from PC. 

5 The specificity of yeast PDAT for different acyl group positions was 

investigated by incubating the microsomes with di-oleoyl-PC carrying a 
[ 14 C]acyi group either at the sn-1 position (Fig. 4A bar 2) or the sn~2 position 
(Fig. 4A bar 3). We found that the major 14 C-!abeled product formed in the 
former case was lyso-PC, and in the latter case TAG. We conclude that yeast 

10 PDAT has a specificity for the transfer of acyi groups from the sn-2 position of 
the phospholipid to DAG, thus forming sn-1 -lyso-PC and TAG. Under the given 
assay conditions, trace amounts of 14 C-!abelled DAG is formed from the sn-1 
labeled pc by the reversible action of a CDP-chofine : choline 
phosphotransferase. This labeled DAG can then be further converted into TAG 

15 by the PDAT activity. It is therefore not possible to distinguish whether the 
minor amounts of labeled TAG that is formed in the presence of di-oieoyi-PC 
carrying a [ 14 C]acyi group in the sn-1 position, is synthesized directly from the 
sn-1 -labeled PC by a PDAT that also can act on the sn-1 postion, or if it is first 
converted to sn-1 -labeled DAG and then acylated by a PDAT with strict 

20 selectivity for the transfer of acyl groups at the sn-2 position of PC. Taken 
together, this shows that the PDAT encoded by YNR008w catalyses an acyl 
transfer from the sn-2 position of PC to DAG, thus causing the formation of 
TAG and lyso-PC. 

25 The substrate specificity of yeast PDAT was further analyzed with respect 

to the headgroup of the acyl donor, the acyl group transferred and the acyl 
chains of the acceptor DAG molecule. The two major membrane lipids of S. 
cerevisiae are PC and PE, and as shown in Fig. 4B (bars 1 and 2), dioleoyl-PE 
is nearly 4-fold more efficient than dioieoyf-PC as acyl donor in the PDAT- 

30 catalyzed reaction. Moreover, the rate of acyl transfer is strongly dependent on 
the type of acyl group that is transferred. Thus, a ricinofeoyl group at the sn-2 
position of PC is 2.5 times more efficiently transferred into TAG than an oleoyl 
group in the same position (Fig. 4B bars 1 and 3). In contrast, yeast PDAT has 
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no preference for the transfer of vernoloyl groups over oleoyl groups (Fig. 4B 
bars 1 and 4). The acyl chain of the acceptor DAG molecule also affects the 
efficiency of the reaction. Thus, DAG with a ricinoleoyl or a vernoloyl group is a 
more efficient acyl acceptor than dioleoyl-DAG (Fig. 4B bars 1, 5 and 6). Taken 
5 together, these results clearly show that the efficiency of the PDAT-catalyzed 
acyl transfer is strongly dependent on the properties of the substrate lipids. 

PDAT penes. Nucleotide and amino acid sequences of several PDAT 
genes are given as SEQ ID No. 1 through 15. Futher provisional and/or partial 

10 sequences are given as SEQ ID NO 16 through 20 and 21 through 31, 
respectively. One of the Arabidopsis genomic sequences (SEQ ID NO. 4) 
identfied an Arabidopsis EST cDNA clone; T04806. This cDNA clone was fully 
characterised and the nucleotide sequence is given as SEQ ID NO. 5. Based 
on the sequence homology of the T04806 cDNA and the Arabidopsis thaliana 

15 genomic DNA sequence (SEQ ID NO 4) it is apparent that an additional A is 
present at position 417 in the cDNA clone (data not shown). Excluding this 
nucleotide would give the amino acid sequence depicted in SEQ ID NO. 12. 

Increased TAG content in seeds of Arabidopsis thaliana that express the 
20 yeast PDAT. For the expression of the yeast PDAT gene in Arabidopsis 
thaiiana an EcoR! fragment from the pBluescript-PDAT was cloned together 
with napin promotor (25) into the vector pGPTV-KAN (26). A plasmid 
(pGNapPDAT) having the yeast PDAT gene in the correct orientation was 
identified and transformed into Agrobacterium tumefaciens. These bacteria 
25 were used to transform Arabidopsis thaiiana Columbia (C-24) plants using the 
root transformation method (27). Plants transformed with an empty vector were 
used as controls. 

First generation seeds (T1) were harvested and germinated on kanamycin 
containing medium. Second generation seeds (T2) were pooled from individual 
30 plants and their fatty acid contents analysed by quantification of their methyl 
esthers by gas liquid chromatography after methylation of the seeds with 2% 
sulphuric acid in methanol at 85 °C for 1,5 hours. Quantification was done with 
heptadecanoic acid methyl esters as internal standard. 
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From the transformation with pGNapPDAT one T1 plant (26-14) gave raise to 
seven T2 plants of which 3 plants yielded seeds with statistically (in a mean 
difference two-sided test) higher oil content than seeds from T2 plants 
generated from T1 plant 32-4 transformed with an empty vector (table 2). 
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Description of Figures 

FIG. 1. 

Metabolism of 14 C-labeled PC into the neutrai lipid fraction by plant 
5 microsomes. (A) Microsomes from developing seeds of sunflower, Ft. 
communis and C. palaestina were incubated for 80 min at 30°C with PC (8 
nmol) having oleic acid in its sn- J \ position, and either 14 C-labeled oleic, 
ricinoleic or vernolic acid in its sn-2 position. Radioactivity incorporated En TAG 
(open bars), DAG (solid bars), and unsterified fatty acids (hatched bars) was 
10 quantified using thin layer chromatography followed by electronic 
autoradiography, and is shown as percentage of added labeled substrate. (B) 
Synthesis in vitro of TAG carrying two vernoloyl and one [ 14 C]ricinoleoyl group 
by microsomes from R. communis. The substrates added were unlabeled 
divernoioyl-DAG (5 nmol), together with either sn-1~oleoyl-sn-2-[ 14 C]ricinoieoyl- 
15 DAG (0.4 nmol, 7700 dpm/nmo!) or sn-1-oleoyf-sn-2-[ 14 C]ricinoieoyf-PC (0.4 
nmol, 7700 dpm/nmoi). The microsomes were incubated with the substrates for 
30 min at 30 o C, after which samples were removed for lipid analysis as 
described in the section 5 ,generai methods". The data shown are the average of 
two experiments. 

20 

FIG. 2. 

PDAT activity in yeast microsomes, as visualized by autoradioqram of neutral 
lipid products separated on TLC . Microsomal membranes (10 nmol of PC) from 
the wild type yeast strain FY1679 (lanes 1-3), a congenic yeast strain 

25 (FVKT004-04C(AL)) that is disrupted for YNR008w (lane 4) or the same 
disruption strain transformed with the plasmid pUS1, containing the YNR008w 
gene behind its native promotor (lane 5), were assayed for PDAT activity. As 
substrates, we used 2 nmol sn-1-oleoyl-sn-2-[ 14 C]ricinoieoy!-PC together with 
either 5 nmol of dioleoyi-DAG (lanes 2, 4 and 5) or rac-ofeoyl-vernoleoyl-DAG 

30 (lane 3). The enzymatic assay and lipid analysis was performed as described in 
Materials and Methods. The cells were orecultured for 20 h in liquid YPD 
medium, harvested and re-suspended in an equal volume of minimal medium 
(19) containing 16 g/l glycerol. The cells were then grown for an additional 24 h 
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prior to being harvested. Selection for the plasmid was maintained by growing 
the transformed cells in synthetic medium lacking uracil (18). Abbreviations: 1- 
OH-TAG, monoricinoleoyl-TAG; 1 -OH-1 -ep-TAG, monoricinoleoyl- 
monovernoioyl-TAG; OH-FA, unesterified ricinoleic acid. 

5 

Fig. 3. 

Lipid content (A.B) and PDAT activity (C) in PDAT overexpressinq yeast cells. 
The PDAT gene in the plasmid pUS4 was overexpressed from the galactose- 
induced GAL1-TPK2 promotor in the wild type strain W303-1A (7). Its 

io expression was induced after (A) 2 hours or (B) 25 hours of growth by the 
addition of 2% final concentration (w/v) of galactose. The ceils were then 
incubated for another 22 hours before being harvested. The amount of lipids of 
the harvested cells was determined by GLC-analysis of its fatty acid contents 
and is presented as pmol fatty acids per mg dry weight in either TAG (open 

15 bar), polar lipids (hatched bar), sterol esters (solid bar) and other lipids (striped 
bar). The data shown are the mean values of results with three independent 
yeast cultures. (C) In vitro synthesis of TAG by microsomes prepared from 
yeast cells containing either the empty vector (vector) or the PDAT plasmid (+ 
PDAT). The cells were grown as in Fig. 3A. The substrate lipids dioleoyl-DAG 

20 (2.5 nmol) and sn-1 -oleoyl-sn-2-[ 14 C]-oieoyf-PC (2 nmol) were added to aliquots 
of microsomes (10 nmol PC), which were then incubated for 10 min at 28 °C. 
The amount of label incorporated into TAG was quantified by electronic 
autoradiography. The results shown are the mean values of two experiments. 

25 FIG. 4. 

Substrate specificity of yeast PDAT. The PDAT activity was assayed by 
incubating aliquots of lyophilized microsomes (10 nmol PC) with substrate lipids 
at 30°C for 10 min (panel A) or 90 min (panel B). Unlabeled DAG (2.5 nmol) 
was used as substrates together with different labeled phospholipids, as shown 
30 in the figure. (A) Sn-position specificity of yeast PDAT regarding the acyl donor 
substrate. Dioleoyl-DAG together with either SA?-1-[ 14 C]oieoyi-sn-2-[ 14 C]oieoyi- 
PC (di-[ 14 C]-PC), s/>1-[ 14 C]oleoyl-sn-2-oleoyi-PC (sn1-( !4 G]-PC) or sn-1-oleoyl- 
s/7-2-[ 14 C]oleoy!-PC (sn2-[ 14 C]-PC). (B) Specificity of yeast PDAT regarding 
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phospholipid headgroup and of the acyl composition of the phospholipid as well 
as of the diacylglycerol. Dioleoyl-DAG together with either SA>1-oIeoyI-s/>2- 
[ 14 C]oleoy!-PC (oieoyi-PC), sn-1-oleoyl^s/7-2-[ 14 C]oteoyl-PE (oleoyl-PE), sn-1- 
oieoyl-sn-2-[ 14 C]ricino!eoyl-PC (ricinoleoyl-PC) or srM-oleoyl-sn-2- 

5 [ 14 C]vernoloyl-PC (vernoioyl-PC). in the experiments presented in the 2 bars to 
the far right, monoricinoieoy!-DAG (ricinoleoyl-DAG or mono-vernoioyl-DAG 
(vernoioyl-DAG) were used together with sn-1-oleoyl-sn-2~[ 14 C]-oleoyl-PC. The 
label that was incorporated into TAG (solid bars) and lyso-PC (LPC, open bars) 
was quantified by electronic autoradiography. The results shown are the mean 

io values of two experiments. The microsomes used were from W303-1A cells 
overexpressing the PDAT gene from the GAL1-TPK2 promotor, as described in 
Fig. 3. The expression was induced at early stationary phase and the ceils were 
harvested after an additional 24 h. 

15 

TAB.1: 

In vitro synthesis of triacylqlycerols in microsomal preparations of developing 
castor bean. Aliqouts of microsomes (20 nmol PC) were lyophifised and 
substrate lipids were added in benzene solution: (A) 0.4 nmol [ 14 C]-DAG (7760 

20 dpm/nmoi) and where indicated 1.6 nmol unlabelled DAG; (B) 0.4 nmol [ 14 C]- 
DAG (7760 dpm/nmol) and 5 nmol unlabelled di-ricinoleoyl-PC and (C) 0.25 
nmol [ 14 C]-PC (4000 dpm/nmol) and 5 nmol unlabelled DAG. The benzene was 
evaporated by N 2 and 0.1 ml of 50 mM potassium phosphate was added, 
thoroughly mixed and incubated at 30 °C for (A) 20 min.; (B) and (C) 30 min.. 

25 Assays were terminated by extraction of the lipids in chloroform. The lipids 
were then separated by thin layer chromatography on silica gel 60 plates 
(Merck; Darmstadt, Germany) in hexan/diethylether/acetic 35:70:1.5. The 
radioactive lipids were visualised and the radioactivity quantified on the plate by 
electronic autoradiography (Instant Imager. Packard, US). Results are 

30 presented as mean values of two experiments. 

Radioactivity in different triacylglycerols (TAG) species formed. Abbreviations 
used: 1-OH-, mono-ricinoleoyl-; 2-OH, di-ricinoleoyi-; 3-OH-, triricinoleoyl; 1- 
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OH-1 -ver-, mono-ricinoleoly-monovernoleoyl-; 1 -OH-2-ver-, mono-ricinoleoyl- 
divernoleoyl-. Racliolabelled DAG and PC were prepared enzymaticalty. The 
radiolabeled ricinoteoyl group is attached at the sn-2-position of the lipid and 
unlabeiled oteoyl group at the sn-1 -position. Unlabelled DAG with vernoleoyl- or 
ricinoleoyi chains were prepared by the action of TAG lipase (6) on oil of 
Euphorbia lagascae or Castor bean, respectively. Synthetic di-ricinoleoyi-PC 
was kindly provided from Metapontum Agribios (Italy). 



TAB. 2: 

Total fatty acids per mg of T2 seeds pooled from individual Arabidopsis thaliana 
plants transformed with yeast PDAT gene under the control of napin promotor 
(26-14) or transformed with empty vector (32-4). 

* = stastistical difference between control plants and PDAT transformed plants 
in a mean difference two-sided test at a - 5. 



BASF-NAE 3377/99 PCT 
Description of the SEQ ID: 



28 



12.09-2000 



SEQ ID NO. 1: Genomic DNA sequence and suggested amino acid sequence of 
the Saccharomyces cerevisiae PDAT gene, YNR008w, with GenBank accession 
5 number Z71623 and Y13139, and with nucleotide ID number 1302481. 

SEQ ID NO. 2: The amino acid sequence of the suggested open reading frame 
YNR008wfrom Saccharomyces cerevisiae. 

10 SEQ ID NO. 3: Genomic DNA sequence of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 4: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AB006704. 

15 

SEQ ID NO. 5: Nucleotide sequence of the Arabidopsis thaiiana cDNA cione with 
GenBank accession number T04806, and nucleotide ID number 315966. 

SEQ ID NO. 6: Predicted amino acid sequence of the Arabidopsis thaiiana cDNA 
20 clone with GenBank accession number T04806. 

SEQ ID NO. 7: Nucleotide and amino acid sequence of the Zea mays EST clone 
with GenBank accession number AI491339, and nucleotide ID number 4388167. 

25 SEQ ID NO. S: Predicted amino acid sequence of the Zea mays EST clone with 
GenBank accession number AI491339, and nucleotide ID number 4388167. 

SEQ ID NO. 9: DNA sequence of part of the Neurospora crassa EST clone 
W07G1, with GenBank accession number AI398644, and nucleotide ID number 
30 4241729. 



SEQ ID NO. 10: Genomic DNA sequence of part of the Arabidopsis thaiiana locus 
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with GenBank accession number AC004557. 



SEQ ID NO. 11: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AC003027. 

SEQ ID NO. 12: DNA sequencce of part of the Lycopersicon esculentum cDNA 
clone with GenBank accession number AI486635. 



SEQ ID NO. 13: Amino acid sequence of the Schizosaccharomyces pombe 
10 putative open reading frame CAA22887 of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 14: Amino acid sequence of the Arabidopsis thaliana putative open 
reading frame AAC80628 derived from the Arabidopsis thaliana locus with 
15 GenBank accession number AC004557. 



SEQ ID NO 15: Amino acid sequence of the Arabidopsis thaliana putative open 
reading frame AAD10668 derived from the Arabidopsis thaliana locus with 
GenBank accession number AC003027. 

20 

Further provisional and/or partial sequences are defined through the following 
SEQ IDs: 

SEQ ID NO. 16: The amino acid sequence of the yeast ORF YNR008w from 
25 Saccharomyces cerevisiae. 

SEQ ID NO. 17: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AC004557). 

30 SEQ ID NO. 18: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AB006704). 
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SEQ ID NO. 19: The corresponding genomic DNA sequence and amino acid 
sequence of the yeast ORF YNROOSw from Saccharomyces cerevisiae. 

SEQ ID NO. 20: The amino acid sequence of the yeast ORF YNR008w from 
5 Saccharomyces cerevisiae derived form the corresponding genomic DNA 
sequence. 

SEQ ID NO. 21: Genomic DNA sequence of the Saccharomyces cerevisiae 
io PDAT gene, YNR008w, genebank nucleotide ID number 1302481, and the 
suggested YNR008w amino acid sequence. 

SEQ ID NO. 22: The suggested amino acid sequence of the yeast gene 
YNR008w from Saccharomyces cerevisiae. 

15 

SEQ ID NO, 23: Genomic DNA sequence of the Schizosaccharomyces pombe 
gene SPBC776.14. 

SEQ ID NO. 24: Genomic DNA sequence of part of the Arabidopsis thaliana 
20 locus with genebank accession number AB006704. 

SEQ ID NO. 25: Nucleotide sequence and the corresponding amino acid 
sequence of the Arabidopsis thaliana EST-clone with genebank accession 
number T04806, and ID number 315966. 

25 

SEQ ID NO. 26: Nucleotide and amino acid sequence of the Zea mays cDNA 
clone with genebank ID number 4388167. 

SEQ ID NO. 27: Amino acid sequence of the Zea mays cDNA clone with 
30 genebank ID number 4388167. 

SEQ ID NO. 28: DNA sequence of part of the Neurospora crassa cDNA clone 
WQ7G1, ID number 4241729. 
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SEQ ID NO. 29: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC004557. 

SEQ ID NO. 30: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC003027. 



SEQ ID NO. 31: DNA sequence of part of the Lycopersicon esculentum cDNA 
clone with genebank accession number AI486635. 
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Claims 

1. An enzyme catalysing in an acyl-CoA-independent reaction the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway 

5 for the production of triacylglycerol. 

2. An enzyme according to claim 1, comprising an amino acid sequence as 
set forth in SEQ ID No. 2 or a functional fragment, derivate, allele, homolog 
or isoenzyme thereof. 

10 

3. An enzyme according to claims 1 or 2 designated as 
phospholipid:diacylglyceroi acyitransferase (PDAT). 

4. An enzyme according to claims 1 to 3, comprising an amino acid sequence 
15 as set forth in SEQ ID No. 16, 20 or 22 or a functional fragment, derivate, 

allele, homolog or isoenzyme thereof. 

5. An enzyme according to claims 1 to 4, comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID No. 

20 6, 8, 13, 14, 15, 17, 18, 25 or 27 or a functional fragment, derivate, allele, 

homolog or isoenzyme thereof. 

6. An enzyme according to claims 1 to 5, comprising an amino acid sequence 
encoded through a nucleotide sequence, a portion, derivate, allele or 

25 homolog thereof selected from the group consisting of sequences as set 

forth in SEQ ID No. 1, 3, 4, 5, 7, 9, 10, 11, 12, 19, 21, 23, 24, 25, 26, 28, 
29, 30 or 31 or a functional fragment, derivate, allele, homolog or 
isoenzyme of the enzyme encoding amino acid sequence. 

30 7. A nucleotide sequence encoding an enzyme catalysing in an acyl-CoA- 
independent reaction the transfer of fatty acids from phospholipids to 
diacylglycerol in the biosynthetic pathway for the production of 
triacylglycerol. 
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8. A nucleotide sequence according to claim 7 encoding an enzyme 
designated as phospholipid:diacylglycerol acyltransferase (PDAT). 

5 9. A nucleotide sequence according to claims 7 or 8, selected from the group 
consisting of sequences as set forth in SEQ ID No. 1, 3, 4, 10, 11, 19, 21, 
23, 24, 29 or 30 or a portion, derivate, allele or homolog thereof. 

10. A partial nucleotide sequence corresponding to a fullength nucleotide 
io sequence according to claims 7 to 9, selected from the group consisting of 

sequences as set forth in SEQ ID No. 5, 7, 9, 12, 25, 26, 28 or 31 or a 
portion, derivate, allele or homolog thereof. 

11. A nucleotide sequence according to claims 7 to 10, comprising a nucleotide 
15 sequence which is at least 40% homologous to a nucleotide sequence 

selected form the group consisting of those sequences set forth in SEQ ID 
No. 1, 3, 4, 5, 7, 9, 10, 11, 12, 19, 21, 23, 24, 25, 26, 28, 29, 30 or 31. 

12. A gene construct comprising a nucleotide sequence according to claims 7 
20 to 1 1 operabiy linked to a heterologous nucleic acid. 

13. A vector comprising a nucleotide sequence according to claims 7 to 1 1 or a 
gene construct according to claim 12. 

25 14. A vector according to claim 13, which is an expression vector. 

15. A vector according to claims 13 or 14, further comprising a selectable 
marker gene and/or nucleotide sequences for the replication in a host cell 
or the integration into the genome of the host cell. 

?o 

16. A transgenic cell or organism containing a nucleotide sequence according 
to claims 7 to 1 1 and/or a gene construct according to claim 12 and/or a 
vector according to claims 13 to 15. 
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17. A transgenic cell or organism according to claim 16 which is an eucaryotic 
cell or organism. 

5 18. A transgenic ceil or organism according to claims 16 or 17 which is a yeast 
cell or a plant cell or a plant. 

19. A transgenic cell or organism according to claims 16 to 18 having an 
altered biosynthetic pathway for the production of triacylglycerol. 

10 

20. A transgenic eel! or organism according to claims 16 to 19 having an 
altered oil content. 

21. A transgenic ceil or organism according to claims 16 to 20 wherein the 
15 activity of PDAT is altered. 

22. A transgenic cell or organism according to claims 16 to 21 wherein the 
altered activity of PDAT is characterized by an alteration in gene 
expression, catalytic activity and/or regulation of activity of the enzyme. 

20 

23. A transgenic cell or organism according to claims 16 to 22 wherein the 
altered biosynthetic pathway for the production of triacylglycerol is 
characterized by the prevention of accumulation of undesirable fatty acids 
in the membrane lipids. 

25 

24. A process for the production of triacylglycerol, comprising growing a 
transgenic ceil or organism according to claims 16 to 23 under conditions 
whereby the said nucleotide sequence according to claims 7 to 11 is 
expressed. 

30 

25. Triacylglycerols produced by a process according to claim 24. 
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26. Use of a nucleotide sequence according to claims 7 to 1 1 and/or an 
enzyme according to claims 1 to 6 for the production of triacyiglycerol 
and/or triacylgiycerols with uncommon fatty acids, 

27. Use of a nucleotide sequence according to claims 7 to 1 1 and/or an 
enzyme according to claims 1 to 6 for the transformation of any cell or 
organism in order to be expressed in this cell or organism and result in an 
altered, preferably increased oil content of this cell or organism. 
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Abstract of the Disclosure 



The present invention relates to the isolation, identification and characterization 
5 of nucleotide sequences encoding an enzyme catalysing the transfer of fatty 
acids from phospholipids to diacyiglycerol in the biosynthetic pathway for the 
production of triacylglycerol, to the said enzymes and a process for the 
production of triacylglycerols. 



10 
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Fig. 1: 
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Fig 2 
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Hg.4: 09/937779 



Lipid synthesis (pmol/min) 




Tab. 2: 

Tl plant T2 plant number nmol fatty acids per mg seed standard deviation 



1 


1277 


+11 (n=2) 




1261 


±63 (n=3) 


5 


1369 


+17 (n=3) 


6 


1312 


+53 (n=4) 


7 


1197 


±54 (n=5) 


8 


1240 


±78 (n=4) 


9 


1283 


±54 (n= 5) 


10 


1381 


±35 (n=5) 


1 


1444 


±110(n=4) 


2 


1617* 


±109 (n=4) 


3 


1374 


±37 (n=2) 


5 


1562* 


+70 (n=4) 


6 


1393 


±77 (n=4) 


7 


1433 


±98 (n=4) 


8 


1581* 


±82 (n=4) 
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26474 JUL 0 2 2002 ^ ' 0093/000003 

PATENT & TRADEMARK OFFICE 

We (I), the undersigned inventor(s), nW^j^decig£fe(s) that: 
My residence, post office address and citizenship are as stated below next to my name, 

We (I) believe that we are (I am) the original, first, and joint (sole) inventor(s) of the subject matter which is claimed and 
for which a patent is sought on the invention entitled 

NEW CLASS OF ENZYMES IN THE BIOSYNTHETIC PATHWAY FOR THE PRODUCTION OF 
TRIACYCLOGLYCEROL AND RECOMBINANT DNA MOLECULES ENCODING THESE ENZYMES 



the specification of which 

[ ] is attached hereto. 



H was filed on Se ptember 28, 2001 

Application Serial No. 09/937,779 



and amended on 



[x] was filed as PCT international application 



Number PCT/EP/00/02701 



on March 28. 2000 



and was amended under PCT Article 1 9 

on (if applicable). 

We (I) hereby state that we (I) have reviewed and understand the contents of the above-identified specification, including 
the claims, as amended by any amendment referred to above. 



We (I) acknowledge the duty to disclose information known to be material to the patentability of this application as 
defined in Section 1.56 of Title 37 Code of Federal Regulations. 



We (I) hereby claim foreign priority benefits under 35 U.S.C. § 1 19(a)-(d) or § 365(b) of any foreign application(s) for 
patent or inventor's certificate, or § 365(a) of any PCT International application which designated at least one country other 
than the United States, listed below and have also identified below, by checking the box, any foreign application for patent 
or inventor's certificate, or PCT International application having a filing date before that of the application on which priority 
is claimed. Prior Foreign Application(s) 



Application No. Country Day/Month/Year Priority 

Claimed 

99 1 06656 A Europe 01 April 1999 [x] Yes [ 1 No 

99111321.8 Europe 10 June 1999 [x] Yes [ ] No 

60/180687 United States of America 07 February 2000 [x] Yes [ ] No 
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' ^ ^nders Dahlqvist 
NAME OF SOLfcOR 



Signature of Inventor 
Date 29.08. 2001 




Residence: 
Hemmansvagen 2 

24466 Furulund - 

Sweden ^.3 
Citizen of: Sweden 
Post Office Address: same as residence 



>ND JOINT INVENTOR 




Signature oTInventor 
Date 2/9.08-2001 



yO^Xiari 



arit Lenman 



p NAME OF THIRD JOINT INVENTOR 

Signature of Inventor 
Date 29.08.2001 



Residence: 

Liljegatan7b ^ . 
75324 Uppsala_ O <^S\ 
Sweden 

Citizen of: Sweden 

Post Office Address: same as residence 



Residence: 

Revingegatan 13a 

22359 Lund <Q ^CT 

Sweden ^ ^ — ^ 
Citizen of: Sweden 
Post Office Address: same as residence 




Antoni Banas 



— N?0vffi OF FOURTH JOINT INVENTOR 




Residence: 
Wiolinowa 14 
08110 Siedlc e 

Poland j j 

Citizen of: Sweden" fcsA^alwcL 
Post Office Address: same as residence 



Date 29.08.2001 



r\ s Hans Ronne a 

^ NXMETTFTTFTH JOINT INVENTOR 



Signature of Inventor 
Date 29.08.2001 



Residence: 
Dirigentvagen 169 
75654 Uppsal a 
Sweden 
Citizen of: Sweden 

Post Office Address: same as residence 
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We (I) hereby claim the benefit under Title 35, United States Codes, § 119(e) of any United States provisional 
application(s) listed below. 



(Application Number) (Filing Date) 



(Application Number) (Filing Date) 



We (I) hereby claim the benefit under 35 U.S.C. § 120 of any United States application(s), or § 365(c) of any PCT 
International application designating the United States, listed below and, insofar as the subject matter of each of the claims 
of this application is not disclosed in the prior United States or PCT International application in the manner provided by the 
first paragraph of 35 U.S.C. §112,1 acknowledge the duty to disclose information which is material to patentability as defined 
in 37 CFR §1.56 which became available between the filing date of the prior application and the national or PCT International 
filing date of this application. 

Application Serial No. Filing Date Status (pending, patented, 

abandoned) 



/jL^ And we (I) hereby appoint HERBERT. B. KEIL, Registration Number 18,967; and RUSSEL E. WEINKAUF, 
^Registration Number 1 8, 495 ; the address of both being Mejsr sJCeil & Weinkauf 1 1 0 1 C?onnecticut Ave., N. W., Washin gton, 

D.C. 20036 (telephone 202^559-0100), our attorneys, with full power of substitution and revocation, to prosecute this 
" application, "to make alterations and amendments therein, to sign the drawings, to receive the patent, and to transact all 

business in the Patent Office connected therewith. 

We (I) declare that all statements made herein of our (my) own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that willful 
false statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the application or any patent issuing 
thereon. 
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SEQUENCE LISTING 

<110> Dahlquist, Anders, 

Stahl, Ulf 

Lenman, Mar it 

Banas, Antoni 

Ronne , Hans 

Stymne, Sten 

<12 0> A NEW CLASS OF ENZYMES IN THE BIOSYNTHETIC PATHWAY FOR THE PRODUCTION OF 
TRIACYLGLYCEROL AND RECOMBINANT DNA MOLECULES ENCODING THESE ENZYMES 

<130> BASF-NAE-3377-99-Sept-2000 

<140> US 09/937,779 

<150> PCT/EP 00/02701 
<151> 2000-03-23 

<160> 31 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1986 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (1) . . (1983) 

<400> 1 

at g ggc aca ctg ttt*. cga aga aat gtc cag aac caa aag agt gat tct 48 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

gat gaa aac aat aaa ggg ggt tct gtt cat aac aag cga gag age aga 96 
Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

aac cac att cat cat caa cag gga tta ggc cat aag aga aga agg ggt 144 
Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

att agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 192 
lie Ser Gly Ser Ala Lys Arg Asn Glu Arg *Gly Lys Asp Phe Asp Arg 
50 55 60 

aaa aga gac ggg aac ggt aga aaa cgt tgg aga gat tec aga aga ctg 24 0 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 ' 70 '75 80 

att ttc att ctt ggt gca ttc tta ggt gta ctt ttg ccg ttt age ttt 288 
lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 



ggc get tat 



cat gtt cat aat age gat age gac ttg ttt gac aac ttt 



336 



2 



Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

gta aat ttt gat tea ctt aaa gtg tat ttg gat gat tgg aaa gat gtt 384 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt ata agt teg ttt att gat gat att cag get ggt aac 432 
Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 

tac tec aca tct tct tta gat gat etc agt gaa aat ttt gee gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cgt gat tat aat ate gag gee aaa cat cct gtt gta 528 
Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

atg gtt cct ggt gtc att tct acg gga att gaa age tgg gga gtt att 576 
Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 
180 185 190 

gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 720 
Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 768 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 
245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 
Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 270 

ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 864 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 912 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 960 
Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 1008 
He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 330 335 



gtc gag get gaa ggc cct ctt tac ggt aat ggt ggt cgt ggc tgg gtt 1056 
Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

aac gaa cac ata gat tea ttc att aat gca gca ggg acg ctt ctg ggc 1104 
Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

get cca aag gca gtt cca get eta att agt ggt gaa atg aaa gat acc 1152 
Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 
370 375 380 

att caa tta aat acg tta gec atg tat ggt ttg gaa aag ttc ttc tea 1200 
lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

aga att gag aga gta aaa atg tta caa acg tgg ggt ggt ata cca tea 124 8 
Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
405 410 415 

atg eta cca aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 1296 
Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
420 425 430 

tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 1344 
Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 

cga ttt gaa agg aat acg age gat get ttc aac aaa aat ttg aca atg 13 92 
Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

aaa gac gee att aac atg aca tta teg ata tea cct gaa tgg etc caa 1440 
Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 480 

aga aga gta cat gag cag tac teg ttc ggc tat tec aag aat gaa gaa 14 88 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca atg 1536 
Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

gaa gta cca ctt cca gaa get ccc cac atg aaa ate tat tgt ata tac 1584 
Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 
515 520 525 

ggg gtg aac aac cca act gaa agg gca tat gta tat aag gaa gag gat 1632 
Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

gac tec tct get ctg aat ttg acc ate gac tac gaa age aag caa cct 1680 
Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

gta ttc etc acc gag ggg gac gga acc gtt ccg etc gtg gcg cat tea 1728 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 



atg tgt cac aaa tgg gcc cag ggt get tea ccg tac aac cct gec gga 
Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 



att aac gtt act att gtg gaa atg aaa cac cag cca gat cga ttt gat 
He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

ata cgt ggt gga gca aaa age gcc gaa cac gta gac ate etc ggc age 
He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 615 620 

gcg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat 
Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

etc gtc gag cca cgc caa ttg tct aat ttg age cag tgg gtt tct cag 
Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

atg ccc ttc cca atg taa 
Met Pro Phe Pro Met 
660 

<210> 2 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 2 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 135 140 



Tyr Ser Thr Ser 
• l 145 

Lys Gin Leu Leu 



Met Val Pro Gly 
180 

Gly Asp Asp Glu 
195 

Gly Ser Phe Tyr 
210 

Leu Lys His Val 

225 

Phe Thr Leu Arg 



Ala Gly Tyr Trp 
260 

Gly Tyr Glu Pro 
275 

Ala Tyr Leu Asp 

290 

Glu Gin He Glu 

305 

He Gly His Ser 



Val Glu Ala Glu 
340 

Asn Glu His He 
355 

Ala Pro Lys Ala 
370 

He Gin Leu Asn 
385 

Arg He Glu Arg 



Met Leu Pro Lys 
420 

Ser Glu Asp Ala 
435 

Arg Phe Glu Arg 
450 



Ser Leu Asp Asp 
150 

Arg Asp Tyr Asn 
165 

Val He Ser Thr 



Cys Asp Ser Ser 
200 

Met Leu Arg Thr 
215 

Met Leu Asp Pro 

230 

Ala Ala Gin Gly 
245 

He Trp Asn Lys 



Asn Lys Met Thr 
280 

Leu Glu Arg Arg 
295 

Leu Phe His Gin 
310 

Met Gly Ser Gin 
325 

Gly Pro Leu Tyr 



Asp Ser Phe He 
360 

Val Pro Ala Leu 
375 

Thr Leu Ala Met 
390 

Val Lys Met Leu 
405 

Gly Glu Glu Val 



Leu Asn Asn Asn 
440 

Asn Thr Ser Asp 

455 



Leu Ser Glu Asn 
155 

He Glu Ala Lys 
170 

Gly He Glu Ser 
185 

Ala His Phe Arg 



Met Val Met Asp 
220 

Glu Thr Gly Leu 
235 

Phe Glu Ser Thr 
250 

Val Phe Gin Asn 
265 

Ser Ala Ala Tyr 



Asp Arg Tyr Phe 
300 

Leu Ser Gly Glu 
315 

He He Phe Tyr 

330 

Gly Asn Gly Gly 
345 

Asn Ala Ala Gly 



He Ser Gly Glu 
380 

Tyr Gly Leu Glu 
395 

Gin Thr Trp Gly 
410 

He Trp Gly Asp 
425 

Thr Asp Thr Tyr 



Ala Phe Asn Lys 
460 



Phe Ala Val Gly 
160 

His Pro Val Val 
175 

Trp Gly Val He 
190 

Lys Arg Leu Trp 
205 

Lys Val Cys Trp 



Asp Pro Pro Asn 
240 

Asp Tyr Phe He 
255 

Leu Gly Val He 
270 

Asp Trp Arg Leu 
285 

Thr Lys Leu Lys 



Lys Val Cys Leu 
320 

Phe Met Lys Trp 
335 

Arg Gly Trp Val 
350 

Thr Leu Leu Gly 
365 

Met Lys Asp Thr 



Lys Phe Phe Ser 
400 

Gly He Pro Ser 
415 

Met Lys Ser Ser 
430 

Gly Asn Phe He 
445 

Asn Leu Thr Met 



6 



Lys Asp Ala lie Asn Met 
465 470 

Arg Arg Val His Glu Gin 
485 

Glu Leu Arg Lys Asn Glu 
500 

Glu Val Pro Leu Pro Glu 
515 

Gly Val Asn Asn Pro Thr 
530 

Asp Ser Ser Ala Leu Asn 
545 550 

Val Phe Leu Thr Glu Gly 
565 

Met Cys His Lys Trp Ala 
580 

lie Asn Val Thr lie Val 
595 

lie Arg Gly Gly Ala Lys 
610 

Ala Glu Leu Asn Asp Tyr 
625 630 

Leu Val Glu Pro Arg Gin 
645 

Met Pro Phe Pro Met 
660 

<210> 3 
<211> 2312 
<212> DNA 
<213> Schizosaccharomyces pombe 

<400> 3 

atggcgtctt ccaagaagag caaaactcat aagaaaaaga aagaagtcaa atctcctatc 60 
gacttaccaa attcaaagaa accaactcgc gctttgagtg agcaaccttc agcgtccgaa 120 
acacaatctg tttcaaataa atcaagaaaa tctaaatttg gaaaaagatt gaattttata 180 
ttgggcgcta ttttgggaat atgcggtgct ttttttttcg ctgttggaga cgacaatgct 240 
gttttcgacc ctgctacgtt agataaattt gggaatatgc taggctcttc agacttgttt 3 00 
gatgacatta aaggatattt atcttataat gtgtttaagg atgcaccttt tactacggac 360 



Thr Leu 



Tyr Ser 



Leu His 

Ala Pro 
520 

Glu Arg 
535 

Leu Thr 

Asp Gly 

Gin Gly 

Glu Met 
600 

Ser Ala 
615 

lie Leu 
Leu Ser 



Ser lie 

Phe Gly 
490 

His Lys 
505 

His Met 

Ala Tyr 

lie Asp 

Thr Val 
570 

Ala Ser 
585 

Lys His 
Glu His 



Lys lie 



Asn Leu 
650 



Ser Pro 
475 

Tyr Ser 

His Trp 

Lys lie 

Val Tyr 
540 

Tyr Glu 

555 

Pro Leu 
Pro Tyr 



Gin Pro 



Val Asp 
620 

Ala Ser 
635 

Ser Gin 



Glu Trp 

Lys Asn 

Ser Asn 
510 

Tyr Cys 
525 

Lys Glu 

Ser Lys 

Val Ala 

Asn Pro 
590 

Asp Arg 
605 

lie Leu 
Gly Asn 
Trp Val 



Leu Gin 
480 

Glu Glu 
495 

Pro Met 

lie Tyr 

Glu Asp 

Gin Pro 
560 

His Ser 
575 

Ala Gly 

Phe Asp 

Gly Ser 

Gly Asp 
640 

Ser Gin 

655 



aagccttcgc agtctcctag cggaaatgaa 
ggatatcgaa gtgaccatcc tgttattatg 
agttggtcgt ttaataattg ctcgattcct 
tctatgctga aggcaatgtt ccttgacaag 
aaaaaaaccg gcttggatcc gaagggaatt 
gctgattttt ttatcacggg ctattggatt 
attggttatg agcctaataa catgttaagt 
aatttagagg aacgtgataa atatttttca 
attgtacata agaaaaaggt agtgttgatt 
tattttttta agtgggttga agctgagggc 
gatcatattg aagcatttat aaatgtgagt 
tttgaataga tatcgggatc tttgattgga 
ggtgaaatga aagatacagg tattgtaatt 
taaccgtttt aagctcaatt gaatcagttt 
gatttgttac taatttactg tttagtttgg 
caaaaataca aatgtgctct actttttcta 
ctatgggagg agttagttct atgcttccta 
gttgggtaag aaatatgtgc tgttaatttt 
atcaaacaaa tttttccaat ggtgcaatta 
acgatgaatt tgacatagat gatgcattac 
ttaaagtcat gctagcgaaa aattattccc 
taaaaaataa cgaaatgccg tctaaatgga 
tactaaatta tactaaccca aatagactag 
ttgcgttcac ggggtcggaa aaccaactga 
ggggcaacct gtcattgatt cctcggttaa 
atttatgttt caaacattct attaactgtt 
tggaacttta ccaatattag cccttggttt 
gtttaatcct gctaatacaa gtatcacaaa 
tgatctgaga ggaggacctc gctcggcaga 
aaatgtatgt tcattttacc ttacaaattt 



7 

gttcaagttg gtcttgatat gtacaatgag 420 
gttcctggtg ttatcagctc aggattagaa 480 
tactttagga aacgtctttg gggtagctgg 540 
caatgctggc ttgaacattt aatgcttgat 600 
aagctgcgag cagctcaggg gtttgaagca 660 
tggagtaaag taattgaaaa ccttgctgca 720 
gcttcttacg attggcggtt atcatatgca 780 
aagttaaaaa tgttcattga gtacagcaac 840 
tctcactcca tgggttcaca ggttacgtac 900 
tacggaaatg gtggaccgac ttgggttaat 960 
ctcgatggtt gtttgactac gtttctaact 1020 
gcacccaaaa cagtggcagc gcttttatcg 1080 
acattaaaca tgttaatatt taatttttgc 1140 
tcggtctatg ggtaagcaat aaattgttga 1200 
aaaaattttt ttcccgttct gaggtatatt 1260 
acttttaata gagagccatg atggttcgca 1320 
aaggaggcga tgttgtatgg ggaaatgcca 13 8 0 
ttattaatat ttaggctcca gatgatctta 1440 
ttcgatatag agaagacatt gataaggacc 1500 
aatttttaaa aaatgttaca gatgacgatt 1560 
acggtcttgc ttggactgaa aaagaagtgt 162 0 
taaatccgct agaagtaaga acattaaagt 1680 
tcttccttat gctcctgata tgaaaattta 1740 
gagaggttat tattatacta ataatcctga 1800 
tgatggaaca aaagttgaaa atgtgagaga 1860 
ttattagggt attgttatgg atgatggtga 1920 
ggtgtgcaat aaagtttggc aaacaaaaag 1980 
ttatgaaatc aagcatgaac ctgctgcgtt 2040 
acacgtcgat atacttggac attcagagct 2100 
ctattactaa ctcttgaaat aaggaaatta 2160 



ttttaaaagt ttcatcaggc catggtgact 
agtacggaca taagttttgt agattgcaat 
atgagataaa tctcgataaa cctagaaatt 

<210> 4 
<211> 3685 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 4 

atgcccctta ttcatcggaa aaagccgacg 
gtggtgcacg atgaggattc gcaaaagaaa 
aaatcgaacg gaggagggaa gtggtcgtgc 
gtgtgtgtaa cctggtggtt tcttctcttc 
cagtatgtaa cggagcgaat cacgggtcct 
aaagaaggtc ttaaggcgaa acatcctgtt 
ctcgagcttt gggaaggcaa acaatgcgct 
ggaacttttg gtgaagtcta caaaaggtga 
gggatttgga ttggatctga tgagatcacg 
tttaattcca tgtttgtctg tcttactctt 
attttcttaa gagactattt ctgtatgtgt 
tggactattt ctgtttgatt gttaacttta 
gtcatcttat tgccaaatct gttgctagac 
ttcctttact gtcgttgcgt gtagatttag 
tatgttttgt tgggaataga gaagttctaa 
gatagaggac tgttgcttta ttattcaact 
tttgatcttt cagctcaatg tgcttttctc 
cgagtttatt cacatgtctt gaatttcgtc 
cctcccgacc ctgctatgga tatattaaaa 
acgatctgca tcttcttctt tcggctcagt 
aatgttattg ttgatggtaa cagtggtata 
ctttgattca ggcctctatg ttgggtggaa 
gatccagctg gtattagagt tcgagctgta 
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cggtaccaaa ccgttatata tcagatatcc 2220 
taactaacta accgaacagg gaaataataa 2280 
aa 2312 



gagaaaccat cgacgccgcc atctgaagag 60 
ccacacgaat cttccaaatc ccaccataag 120 
atcgattctt gttgttggtt cattgggtgt 180 
ctttacaacg caatgcctgc gagcttccct 240 
ttgcctgacc cgcccggtgt taagctcaaa 300 
gtcttcattc ctgggattgt caccggtggg 360 
gatggtttat ttagaaaacg tttgtggggt 420 
gctcaacaat tctcactctt cctttatatt 480 
cacttgttgc ttcttcaaca tcactcaaac 540 
tacttttttt tttttttgat gtgaaacgct 600 
aaggtaagcg ttccaaggac gtaattggct 660 
ggatataaaa tagctgcctt ggaatttcaa 720 
atgccctaga gtccgttcat aacaagttac 780 
ctttgtgtag cgtataatga agtagtgttt 840 
ctacatctgt ggaaagtgtg ttcaggctgt 900 
atgtatatgt gtaattaaag ctagttcctt 960 
aatttttttc tcaatttcaa agtttcacat 1020 
catcctcgtt ctgttatcca gctttgaact 1080 
aaaaagtgtt ttgtgggttg catctttgtt 1140 
gttcatgttt ttgctatggt agagatgggc 1200 
gttgatagta tcttaactaa tcaattatct 1260 
cacatgtcac ttgacaatga aactgggttg 1320 
tcaggactcg tggctgctga ctactttgct 1380 



cctggctact ttgtctgggc agtgctgatt 
aaaaatatgt acatggctgc atatgactgg 
ttctcatcgt tctttctatt attctgttcc 
cttaaatatg tttcatgttg aattaatagg 
gtaatataga gttgatggtt tctaccaacg 
ccatgggggt cttgtatttt ctacatttta 
gtggcggggg tgggccagat tggtgtgcaa 
gaccatttct tggtgttcca aaagctgttg 
ttgcagttgc caggtattga atatctgctt 
gaactcaaag ttattctact aaatatcaat 
ctgacattgg ttgattattt ttgctgctta 
atgatgaatt gataattctt acgcattgct 
taacatttgt catactgtct tttggagggc 
ttccatgctt gcattcttta ccaattagcg 
tgcatctatg gtcttttatt tcttcttaat 
gtcacttggt tccttaatat agaactttac 
aggattctta gacaccgata tatttagact 
acgcacatgg gactcaacaa tgtctatgtt 
gcttgattgg tcaccggaga aaggccacac 
aacttgtggt gaagcaggtg aaaacggagt 
gatgatatct tttgggaaag aagtagcaga 
ttttcgagta aggacatata aatcataata 
atatctgtac attttatctg gtgaagggtg 
cctgtcgtga cgtgtggaca gagtaccatg 
ctgagtataa ggtctacact gctggtgaag 
agatgatggc gcgtggtgcc gctcatttct 
ccaagtatca agatcccaaa tactggtcaa 
gattccaact gtatccttcg tcctgatgca 
gatatggttt tcagctcaaa gcttacaaag 
ctcagtaata ttgaggtgct aaagttgata 



gctaaccttg cacatattgg atatgaagag 1440 
cggctttcgt ttcagaacac agaggttctt 1500 
atgttacgtt tctttcttca ttacttaagg 1560 
tacgtgatca gactcttagc cgtatgaaaa 1620 
gtggaaaaaa agcagttata gttccgcatt 1680 
tgaagtgggt tgaggcacca gctcctctgg 1740 
agtatattaa ggcggtgatg aacattggtg 1800 
cagggctttt ctctgctgaa gcaaaggatg 1860 
atacttttga tgatcagaac cttggctctg 1920 
tctaataaca ttgctatatt atcgctgcaa 1980 
tgtaactgaa actctcttga gattagacaa 2040 
ctgtgatgac cagtttctta gcttcgacga 2100 
attgaatttt gctatggaaa gcgctggagc 2160 
ttattctgct tctttcaatt ttcttgtata 2220 
taaagactcg ttggattagt tgctctatta 2280 
tttcttcgaa aattgcagag cgattgcccc 2340 
tcagaccttg cagcatgtaa tgagaatgac 2400 
accgaaggga ggtgacacga tatggggcgg 24 60 
ctgttgtggg aaaaagcaaa agaacaacga 2 52 0 
ttccaagaaa agtcctgtta actatggaag 2580 
ggctgcgcca tctgagatta ataatattga 2640 
aaccttgtac attttgtgat tgtatgatga 2700 
ctgtcaaagg tcagagtatc ccaaatcaca 2760 
acatgggaat tgctgggatc aaagctatcg 2820 
ctatagatct actacattat gttgctccta 2880 
cttatggaat tgctgatgat ttggatgaca 2 940 
atccgttaga gacaaagtaa gtgatttctt 3000 
ttatcagtct ttttgttttc ggtcttgttg 3060 
ctgtttctga gcctttctca aaaaggcttg 3120 
catgtgactc ttgcttataa atcctccgtt 3180 



tggtttgttc tgctttttca gattaccgaa tgctcctgag atggaaatct actcattata 3240 

cggagtgggg ataccaacgg aacgagcata cgtatacaag cttaaccagt ctcccgacag 33 0 0 

ttgcatcccc tttcagatat tcacttctgc tcacgaggag gacgaagata gctgtctgaa 3360 

agcaggagtt tacaatgfcgg atggggatga aacagtaccc gtcctaagtg ccgggtacat 342 0 

gtgtgcaaaa gcgtggcgtg gcaagacaag attcaaccct tccggaatca agacttatat 3480 

aagagaatac aatcactctc cgccggctaa cctgttggaa gggcgcggga cgcagagtgg 3 54 0 

tgcccatgtt gatatcatgg gaaactttgc tttgatcgaa gatatcatga gggttgccgc 3600 

cggaggtaac gggtctgata taggacatga ccaggtccac tctggcatat ttgaatggtc 3660 

ggagcgtatt gacctgaagc tgtga 3685 

<210> 5 
<211> 2427 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> unsure 
<222> 1. .2427 

<223> n = a or g or c or t/u 
<400> 5 



agaaacagct 


ctttgtctct 


ctcgactgat 


ctaacaatcc 


ctaatctgtg 


ttctaaattc 


60 


ctggacgaga 


tttgacaaag 


tccgtatagc 


ttaacctggt 


ttaatttcaa 


gtgacagata 


120 


tgccccttat 


tcatcggaaa 


aagccgacgg 


agaaaccatc 


gacgccgcca 


tctgaagagg 


180 


tggtgcacga 


tgaggattcg 


caaaagaaac 


cacacgaatc 


ttccaaatcc 


caccataaga 


240 


aatcgaacgg 


aggagggaag 


tggtcgtgca 


tcgattcttg 


ttgttggttc 


attgggtgtg 


300 


tgtgtgtaac 


ctggtggttt 


cttctcttcc 


tttacaacgc 


aatgcctgcg 


agcttccctc 


360 


agtatgtaac 


ggagcgaatc 


acgggtcctt 


tgcctgaccc 


gcccggtgtt 


aagctcaaaa 


420 


aaagaaggtc 


ttaaggcgaa 


acatcctgtt 


gtcttcattc 


ctgggattgt 


caccggtggg 


480 


ctcgagcttt 


gggaaggcaa 


acaatgcgct 


gatggtttat 


ttagaaaacg 


tttgtggggt 


540 


ggaacttttg 


gtgaagtcta 


caaaaggcct 


ctatgttggg 


tggaacacat 


gtcacttgac 


600 


aatgaaactg 


ggttggatcc 


agctggtatt 


agagttcgag 


ctgtatcagg 


actcgtggct 


660 


gctgactact 


ttgctcctgg 


ctactttgtc 


tgggcagtgc 


tgattgctaa 


ccttgcacat 


720 


attggatatg 


aagagaaaaa 


tatgtacatg 


gctgcatatg 


actggcggct 


ttcgtttcag 


780 


aacacagagg 


tacgtgatca 


gactcttagc 


cgtatgaaaa 


gtaatataga 


gttgatggtt 


840 


tctaccaacg 


gtggaaaaaa 


agcagttata 


gttccgcatt 


ccatgggggt 


cttgtatttt 


900 



VA ix;\i' «.Ji 



ctacatttta tgaagtgggt tgaggcacca 
tggtgtgcaa agtatattaa ggcggtgatg 
aaagctgttg cagggctttt ctctgctgaa 
gccccaggat tcttagacac cgatatattt 
atgacacgca catgggactc aacaatgtct 
99cgggcttg attggtcacc ggagaaaggc 
aacgaaactt gtggtgaagc aggtgaaaac 
ggaaggatga tatcttttgg gaaagaagta 
attgattttc gaggtgctgt caaaggtcag 
tggacagagt accatgacat gggaattgct 
tacactgctg gtgaagctat agatctacta 
ggtgccgctc atttctctta tggaattgct 
cccaaatact ggtcaaatcc gttagagaca 
tactcattat acggagtggg gataccaacg 
tctcccgaca gttgcatccc ctttcagata 
agctgtctga aagcaggagt ttacaatgtg 
gccgggtaca tgtgtgcaaa agcgtggcgt 
aagacttata taagagaata caatcactct 
acgcagagtg gtgcccatgt tgatatcatg 
agggttgccg ccggaggtaa cgggtctgat 
tttgaatggt cggagcgtat tgacctgaag 
ctgtcagctt atgtgaatcc aatactttga 
gtcatcatca tgatgctcaa ctcacaaaga 
ctcaatacct ctttaatatt cttattgaat 
acgataacac aaaacttgct gcngccatgt 
gggttaaaaa aaaaaaaaaa aaaaaaa 

<210> 6 
<211> 671 
<212> PRT 

<213> Arabidopsis thaliana 
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gctcctctgg 


gtggcggggg 


tgggccagat 


960 


aacattggtg 


gaccatttct 


tggtgttcca 


1020 


gcaaaggatg 


ttgcagttgc 


cagagcgatt 


1080 


agacttcaga 


ccttgcagca 


tgtaatgaga 


1140 


atgttaccga 


agggaggtga 


cacgatatgg 


1200 


cacacctgtt 


gtgggaaaaa 


gcaaaagaac 


1260 


ggagtttcca 


agaaaagtcc 


tgttaactat 


1320 


gcagaggctg 


cgccatctga 


gattaataat 


1380 


agtatcccaa 


atcacacctg 


tcgtgacgtg 


1440 


gggatcaaag 


ctatcgctga 


gtataaggtc 


1500 


cattatgttg 


ctcctaagat 


gatggcgcgt 


1560 


gatgatttgg 


atgacaccaa 


gtatcaagat 


1620 


aaattaccga 


atgctcctga 


gatggaaatc 


1680 


gaacgagcat 


acgtatacaa 


gcttaaccag 


1740 


ttcacttctg 


ctcacgagga 


ggacgaagat 


1800 


gatggggatg 


aaacagtacc 


cgtcctaagt 


1860 


ggcaagacaa 


gattcaaccc 


ttccggaatc 


1920 


ccgccggcta 


acctgttgga 


agggcgcggg 


1980 


ggaaactttg 


ctttgatcga 


agatatcatg 


2040 


ataggacatg 


accaggtcca 


ctctggcata 


2100 


ctgtgaatat 


catgatctct 


ttaagctgtc 


2160 


aagagagatc 


atcatcaatt 


catcatcatc 


2220 


agcctgagaa 


tgatactttg 


gtgcgaaatt 


2280 


gtaaattata 


caatcctatc 


taatgtttga 


2340 


ttgtttgtct 


tgtcaaaagc 


atcaatttgt 


2400 
2427 



<400> 6 
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Met Pro Leu lie 
1 

Pro Ser Glu Glu 
20 

Glu Ser Ser Lys 
35 

Ser Cys lie Asp 
50 

Trp Trp Phe Leu 
65 

Gin Tyr Val Thr 



Val Lys Leu Lys 
100 

He Pro Gly He 
115 

Cys Ala Asp Gly 
130 

Glu Val Tyr Lys 
145 

Asn Glu Thr Gly 



Gly Leu Val Ala 
180 

Val Leu He Ala 
195 

Tyr Met Ala Ala 
210 

Arg Asp Gin Thr 
225 

Ser Thr Asn Gly 



Val Leu Tyr Phe 
260 

Leu Gly Gly Gly 
275 

Val Met Asn He 
290 

Gly Leu Phe Ser 
305 



His Arg Lys Lys 
5 

Val Val His Asp 



Ser His His Lys 
40 

Ser Cys Cys Trp 
55 

Leu Phe Leu Tyr 
70 

Glu Arg He Thr 

85 

Lys Glu Gly Leu 



Val Thr Gly Gly 
120 

Leu Phe Arg Lys 
135 

Arg Pro Leu Cys 
150 

Leu Asp Pro Ala 
165 

Ala Asp Tyr Phe 



Asn Leu Ala His 
200 

Tyr Asp Trp Arg 
215 

Leu Ser Arg Met 
230 

Gly Lys Lys Ala 
245 

Leu His Phe Met 



Gly Gly Pro Asp 
280 

Gly Gly Pro Phe 
295 

Ala Glu Ala Lys 
310 



Pro Thr Glu Lys 
10 

Glu Asp Ser Gin 
25 

Lys Ser Asn Gly 



Phe He Gly Cys 
60 

Asn Ala Met Pro 

75 

Gly Pro Leu Pro 
90 

Lys Ala Lys His 
105 

Leu Glu Leu Trp 



Arg Leu Trp Gly 
140 

Trp Val Glu His 
155 

Gly He Arg Val 
170 

Ala Pro Gly Tyr 
185 

He Gly Tyr Glu 



Leu Ser Phe Gin 
220 

Lys Ser Asn He 
235 

Val He Val Pro 

250 

Lys Trp Val Glu 
265 

Trp Cys Ala Lys 



Leu Gly Val Pro 
300 

Asp Val Ala Val 
315 



Pro Ser Thr Pro 
15 

Lys Lys Pro His 
30 

Gly Gly Lys Trp 
45 

Val Cys Val Thr 



Ala Ser Phe Pro 
80 

Asp Pro Pro Gly 
95 

Pro Val Val Phe 
110 

Glu Gly Lys Gin 
125 

Gly Thr Phe Gly 



Met Ser Leu Asp 
160 

Arg Ala Val Ser 
175 

Phe Val Trp Ala 
190 

Glu Lys Asn Met 
205 

Asn Thr Glu Val 



Glu Leu Met Val 
240 

His Ser Met Gly 
255 

Ala Pro Ala Pro 
270 

Tyr He Lys Ala 
285 

Lys Ala Val Ala 



Ala Arg Ala He 
320 
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Ala Pro Gly Phe Leu 
325 

His Val Met Arg Met 
340 

Pro Lys Gly Gly Asp 

355 

Lys Gly His Thr Cys 
370 

Gly Glu Ala Gly Glu 
385 

Gly Arg Met lie Ser 
405 

Glu lie Asn Asn lie 
420 

Pro Asn His Thr Cys 
435 

lie Ala Gly lie Lys 
450 

Glu Ala lie Asp Leu 
465 

Gly Ala Ala His Phe 
485 

Lys Tyr Gin Asp Pro 
500 

Pro Asn Ala Pro Glu 
515 

Pro Thr Glu Arg Ala 
530 

Cys lie Pro Phe Gin 
545 

Ser Cys Leu Lys Ala 
565 

Pro Val Leu Ser Ala 
580 

Thr Arg Phe Asn Pro 
595 

His Ser Pro Pro Ala 
610 



Asp Thr Asp lie Phe Arg 
330 

Thr Arg Thr Trp Asp Ser 
345 

Thr lie Trp Gly Gly Leu 
360 

Cys Gly Lys Lys Gin Lys 
375 

Asn Gly Val Ser Lys Lys 
390 395 

Phe Gly Lys Glu Val Ala 
410 

Asp Phe Arg Gly Ala Val 
425 

Arg Asp Val Trp Thr Glu 
440 

Ala lie Ala Glu Tyr Lys 

455 

Leu His Tyr Val Ala Pro 
470 475 

Ser Tyr Gly lie Ala Asp 
490 

Lys Tyr Trp Ser Asn Pro 
505 

Met Glu lie Tyr Ser Leu 
520 

Tyr Val Tyr Lys Leu Asn 
535 

lie Phe Thr Ser Ala His 
550 555 

Gly Val Tyr Asn Val Asp 
570 

Gly Tyr Met Cys Ala Lys 
585 

Ser Gly lie Lys Thr Tyr 
600 

Asn Leu Leu Glu Gly Arg 
615 



Leu Gin Thr Leu Gin 
335 

Thr Met Ser Met Leu 
350 

Asp Trp Ser Pro Glu 
365 

Asn Asn Glu Thr Cys 
380 

Ser Pro Val Asn Tyr 
400 

Glu Ala Ala Pro Ser 
415 

Lys Gly Gin Ser lie 
430 

Tyr His Asp Met Gly 
445 

Val Tyr Thr Ala Gly 
460 

Lys Met Met Ala Arg 
480 

Asp Leu Asp Asp Thr 
495 

Leu Glu Thr Lys Leu 
510 

Tyr Gly Val Gly He 
525 

Gin Ser Pro Asp Ser 
540 

Glu Glu Asp Glu Asp 
560 

Gly Asp Glu Thr Val 
575 

Ala Trp Arg Gly Lys 
590 

He Arg Glu Tyr Asn 
605 

Gly Thr Gin Ser Gly 
620 



Ala His Val Asp He Met Gly Asn Phe Ala Leu He Glu Asp He Met 
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625 630 635 640 

Arg Val Ala Ala Gly Gly Asn Gly Ser Asp lie Gly His Asp Gin Val 
645 650 655 

His Ser Gly lie Phe Glu Trp Ser Glu Arg lie Asp Leu Lys Leu 
660 665 670 

<210> 7 
<211> 643 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . (402) 

<221> unsure 

<222> 1 . . 643 

<223> n= a or g or c or t/u 

<400> 7 



egg gag aaa ata get get ttg aag ggg ggt gtt tac tta gec gat ggt 48 
Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

gat gaa act gtt cca gtt ctt agt gcg ggc tac atg tgt gcg aaa gga 96 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

tgg cgt ggc aaa act cgt ttc age cct gec ggc age aag act tac gtg 144 
Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

aga gaa tac age cat teg cca ccc tct act etc ctg gaa ggc agg ggc 192 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

acc cag age ggt gca cat gtt gat ata atg ggg aac ttt get eta att 240 
Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 70 75 80 

gag gac gtc ate aga ata get get ggg gca acc ggt gag gaa att ggt 2 88 
Glu Asp Val lie Arg lie Ala Ala Gly Ala Thr Gly Glu Glu lie Gly 
85 90 95 

ggc gat cag gtt tat tea gat ata ttc aag tgg tea gag aaa ate aaa 336 
Gly Asp Gin Val Tyr Ser Asp lie Phe Lys Trp Ser Glu Lys lie Lys 
100 105 110 

ttg aaa ttg taa cct atg gga agt taa aga agt gec gac ccg ttt att 384 
Leu Lys Leu 
115 



gcg ttc caa agt gtc ctg ectgagtgea actctggatt ttgcttaaat 432 
attgtaattt ttcacgcttc attcgtccct ttgtcaaatt tacatttgac aggacgecaa 492 



15 

tgcgatacga tgttgtaccg ctattttcag cattgtatat taaactgtac aggtgtaagt 552 

tgcatttgcc agctgaaatt gtgtagtcgt tttctttacg atttaatanc aagtggcgga 612 

gcagtgcccc aagcnaaaaa aaaaaaaaaa a 64 3 

<210> 8 
<211> 115 
<212> PRT 
<213> Zea mays 

<400> 8 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 70 75 80 

Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 
85 90 95 

Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 " 110 

Leu Lys Leu 
115 

<210> 9 
<211> 616 
<212> DNA 

<213> Neurospora crassa 
<220> 

<221> unsure 
<222> 1 . . 616 

<223> n= a or g or c or t/u 
<400> 9 

ggtggcgaag acganggcgg aagttggagg ctaacgagaa tgacnctcgg agatggatct 60 
accctctaga gacacgacta ccnttgcacc cagcctcaag gtntacngtt tntatgggta 120 
ggaagccgac ggagcgagcc tacatctatc tggcgcccga tcccgggacg acaacgcatc 180 
tttagatgac gatcgatacg actttgactn aggggcacat tgaccacggt gtgattttgg 240 
gcgaaggcga tggcacagtg aaccttatga gtttggggta cctgtgcaat aaggggtgga 3 00 
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aaatgaagag 


atacaatcct 


gcgggctcaa 


aaataaccgt 


ggtcgagatg 


ccgcatgaac 


360 


cagaacggtt 


caatccgaga 


ggagggccga 


atacggcgga 


tcacgtggat 


attctaggaa 


420 


ggcagaatct 


aaacgagtac 


attcttaaag 


tggcggcagg 


tcgaggcgat 


acaattgagg 


480 


attttattac 


tagtaatatt 


cttaaatatg 


tagaaaaggt 


tgaaatttat 


gaagagtaat 


540 


taaatacggc 


acataggtta 


ctcaatagta 


tgactaatta 


aaaaaaaatt 


ttttttctaa 


600 


aaaaaaaaaa 


aaaaaa 










616 



<210> 10 
<211> 1562 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 10 



atgaaaaaaa 


tatcttcaca 


ttattcggta 


gtcatagcga 


tactcgttgt 


ggtgacgatg 


60 


acctcgatgt 


gtcaagctgt 


gggtagcaac 


gtgtaccctt 


tgattctggt 


tccaggaaac 


120 


ggaggtaacc 


agctagaggt 


acggctggac 


agagaataca 


agccaagtag 


tgtctggtgt 


180 


agcagctggt 


tatatccgat 


tcataagaag 


agtggtggat 


ggtttaggct 


atggttcgat 


240 


gcagcagtgt 


tattgtctcc 


cttcaccagg 


tgcttcagcg 


atcgaatgat 


gttgtactat 


300 


gaccctgatt 


tggatgatta 


ccaaaatgct 


cctggtgtcc 


aaacccgggt 


tcctcatttc 


360 


ggttcgacca 


aatcacttct 


atacctcgac 


cctcgtctcc 


ggttagtact 


ttccaagata 


420 


tatcattttg 


ggacatttgc 


ataatgaaca 


aaatagacat 


aaatttgggg 


gattattgtt 


480 


atatcaatat 


ccatttatat 


gctagtcggt 


aatgtgagtg 


ttatgttagt 


atagttaatg 


540 


tgagtgttat 


gtgattttcc 


attttaaatg 


aagctagaaa 


gttgtcgttt 


aataatgttg 


600 


ctatgtcatg 


agaattataa 


ggacactatg 


taaatgtagc 


ttaataataa 


ggtttgattt 


660 


gcagagatgc 


cacatcttac 


atggaacatt 


tggtgaaagc 


tctagagaaa 


aaatgcgggt 


720 


atgttaacga 


ccaaaccatc 


ctaggagctc 


catatgattt 


caggtacggc 


ctggctgctt 


780 


cgggccaccc 


gtcccgtgta 


gcctcacagt 


tcctacaaga 


cctcaaacaa 


ttggtggaaa 


840 


aaactagcag 


cgagaacgaa 


ggaaagccag 


tgatactcct 


ctcccatagc 


ctaggaggac 


900 


ttttcgtcct 


ccatttcctc 


aaccgtacca 


ccccttcatg 


gcgccgcaag 


tacatcaaac 


960 


actttgttgc 


actcgctgcg 


ccatggggtg 


ggacgatctc 


tcagatgaag 


acatttgctt 


1020 


ctggcaacac 


actcggtgtc 


cctttagtta 


accctttgct 


ggtcagacgg 


catcagagga 


1080 


cctccgagag 


taaccaatgg 


ctacttccat 


ctaccaaagt 


gtttcacgac 


agaactaaac 


1140 


cgcttgtcgt 


aactccccag 


gttaactaca 


cagct tacga 


gatggatcgg 


ttttttgcag 


1200 



acattggatt 


ctcacaagga 


gttgtgcctt 


acaagacaag 


agtgttgcct 


ttaacagagg 


1260 


agctgatgac 


tccgggagtg 


ccagtcactt 


gcatatatgg 


gagaggagtt 


gatacaccgg 


1320 


aggttttgat 


gtatggaaaa 


ggaggattcg 


ataagcaacc 


agagattaag 


tatggagatg 


1380 


gagatgggac 


ggttaatttg 


gcgagcttag 


cagctttgaa 


agtcgatagc 


ttgaacaccg 


1440 


tagagattga 


tggagtttcg 


catacatcta 


tacttaaaga 


cgagatcgca 


cttaaagaga 


1500 


ttatgaagca 


gatttcaatt 


attaattatg 


aattagccaa 


tgttaatgcc 


gtcaatgaat 


1560 


ga 












1562 



<210> 11 
<211> 3896 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 11 



atgggagcga 


attcgaaatc 


agtaacggct 


tccttcaccg 


tcatcgccgt 


ttttttcttg 


60 


atttgcggtg 


gccgaactgc 


ggtggaggat 


gagaccgagt 


ttcacggcga 


ctactcgaag 


120 


ctatcgggta 


taatcattcc 


gggatttgcg 


tcgacgcagc 


tacgagcgtg 


gtcgatcctt 


180 


gactgtccat 


acactccgtt 


ggacttcaat 


ccgctcgacc 


tcgtatggct 


agacaccact 


240 


aaggtccgtg 


atcttcattt 


ccttcgctcc 


ttattctgtc 


ggtcgagtca 


cttgttgatg 


300 


aattccaagc 


gaaatatagc 


aatgaagcat 


gtctcgtctc 


tcttattgat 


tcgttcatta 


360 


gtcaacagtg 


acgcttctga 


atctgagttt 


agagtcatat 


aaaacagctg 


actcggcgag 


420 


tgtttcccat 


cgcttttggt 


tcgctaaatg 


tagcgcaatg 


aatgtgtaat 


tagtctgcgc 


480 


tttttattca 


actagatctg 


caagtttttc 


agagtgctca 


atagtagtta 


gaaaatgtta 


540 


ggtcatttta 


cttgtgcatt 


gtgattcttt 


tggttgttgc 


ttactgatcg 


acgtgatgga 


600 


tggtttacag 


cttctttctg 


ctgtcaactg 


ctggtttaag 


tgtatggtgc 


tagatcctta 


660 


taatcaaaca 


gaccatcccg 


agtgtaagtc 


acggcctgac 


agtggtcttt 


cagccatcac 


720 


agaattggat 


ccaggttaca 


taacaggtag 


tttcggattt 


ttctttcttt 


tgagttttct 


780 


tcaatttgat 


atcatcttgt 


tgtgatataa 


tatggctaag 


ttcattaatt 


tggtcaattt 


840 


tcaggtcctc 


tttctactgt 


ctggaaagag 


tggcttaagt 


ggtgtgttga 


gtttggtata 


900 


gaagcaaatg 


caattgtcgc 


tgttccatac 


gattggagat 


tgtcaccaac 


caaattggaa 


960 


gagcgtgacc 


tttactttca 


caagctcaag 


ttagtcctta 


tcaggctaat 


gtcttttatc 


1020 


ttctcttttt 


atgtaagata 


agctaagagc 


tctggtcgtc 


ttcctttttg 


caggttgacc 


1080 


tttgaaactg 


ctttaaaact 


ccgtggcggc 


ccttctatag 


tatttgccca 


ttcaatgggt 


1140 
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aataatgtct 


tcagatactt 


tctggaatgg 


ctgaggctag 


aaattgcacc 


aaaacattat 


1200 


ttgaagtggc 


ttgatcagca 


tatccatgct 


tatttcgctg 


ttggtaccgg 


cctactatcc 


1260 


ttaagttacc 


attttatttt 


ttctctaatt 


gggggagtta 


tgttgtgact 


tactggattg 


1320 


agctcgatac 


ctgatttgtt 


gttgatttag 


gagctcctct 


tcttggttct 


gttgaggcaa 


1380 


tcaaatctac 


tctctctggt 


gtaacgtttg- 


gccttcctgt 


ttctgaggtg 


acctctgact 


1440 


tctctttagt 


tttaagtagt 


tgatatcaac 


caggtcttat 


aactcactgg 


attttccttt 


1500 


tgaaagtatt 


acttttgtta 


attgaactgc 


tgtacgcgat 


atggtatctg 


tagatcttga 


1560 


agtgctagtt 


atcaaagaac 


atattgtggg 


tagtatacct 


gtcagcggcc 


ttagctaata 


1620 


caaccaaacc 


acatgtacac 


tgatttagtt 


ttcagattat 


tatggtagac 


tttaagttga 


1680 


gaagaaactt 


tgactgaaat 


ctttttattt 


taataggcta 


tgatttgttt 


attgaaatca 


1740 


tgtgacatat 


tgacatgcgc 


ttctcatgtt 


ttttgttggc 


aaggcttcag 


ggaactgctc 


1800 


ggttgttgtc 


caattctttt 


gcgtcgtcat 


tgtggcttat 


gccattttca 


aagaattgca 


1860 


agggtgataa 


cacattctgg 


acgcattttt 


ctgggggtgc 


tgcaaagaaa 


gataagcgcg 


1920 


tataccactg 


tgatgaagag 


gaatatcaat 


caaaatattc 


tggctggccg 


acaaatatta 


1980 


ttaacattga 


aattccttcc 


actagcggtt 


agactctgta 


tatgcaactg 


taacactaac 


2040 


aaaagtttca 


ccaagaatgt 


tcactctcat 


atttcgttcc 


tttgatgtgt 


atccatcagt 


2100 


tacagaaaca 


gctctagtca 


acatgaccag 


catggaatgt 


ggccttccca 


cccttttgtc 


2160 


tttcacagcc 


cgtgaactag 


cagatgggac 


tcttttcaaa 


gcaatagaag 


actatgaccc 


2220 


agatagcaag 


aggatgttac 


accagttaaa 


gaagtacgta 


cctttctttg 


tgataagaaa 


2280 


tattgctcat 


cgatcatcac 


ttgctggctt 


cttgtacgtc 


aaattgtttt 


gtttaaatct 


2340 


ctatatcaat 


tgttcatatg 


ctttgtcttt 


cttactataa 


gaaacaagta 


taatcagaaa 


2400 


ccttattatt 


gattatcagt 


tctctcctta 


tattatggaa 


tgtctttttc 


gtttacagtt 


2460 


atgaatgcaa 


aagggggtat 


tttagttgat 


tgattctctc 


attctctagt 


ttgttttgac 


2520 


taatagcgtc 


aattttgttt 


ttctagcaaa 


tctttgtgaa 


ttatatataa 


catgctaact 


2580 


atacttttca 


ggttgtatca 


tgatgaccct 


gtttttaatc 


ctctgactcc 


ttgggagaga 


2640 


ccacctataa 


aaaatgtatt 


ttgcatatat 


ggtgctcatc 


taaagacaga 


ggtatgatgc 


2700 


attctcaata 


tcacattatg 


cgttgacttt 


gttattatat 


tccccatttg 


gtttgcaata 


2760 


tctttttgaa 


ttatgattta 


tcttctccct 


tgcatcttat 


gctattaagc 


gttaaaggta 


2820 


ctaaatgtat 


gaagctgtct 


gtcataggtt 


ggttattact 


ttgccccaag 


tggcaaacct 


2880 


tatcctgata 


attggatcat 


cacggatatc 


atttatgaaa 


ctgaaggttc 


cctcgtgtca 


2940 



aggtaatttt 


ccgcaatggc 


agaagtaaaa 


gtggcatgtt 


atctcagttg 


cataagcaaa 


ttatcattcc 


ttttgagctt 


agtggatgat 


tgaaacatga 


cacttgtatc 


aaagataact 


tcatattatt 


aggagtagtc 


gtgcttttaa 


ttcatatctt 


gattgtgcaa 


tatctgcagg 


cctataactg 


gggatgagac 


ggtaagctca 


actactgaag 


actaagataa 


tacttgcttc 


actgcaatat 


tgactctccg 


ctacttttat 


ctatcattca 


ctctcttggt 


gcaagaattg 


tccccaggta 


ctctttttta 


gttcctcacc 


ctggttatgt 


gttgatttac 


ctccaatttg 


cctcaagaac 


ttgtattaat 


ctaaacgaga 


aacacgatgg 


aagcgacgta 


catgtggaac 


tcatagctaa 


catgacaaaa 


gcaccaaggg 


agagcattcc 


ggggaagaga 


accgcagtct 



<210> 12 
<211> 709 
<212> DNA 

<213> Lycopersicon esculentum 
<400> 12 

ctggggccaa aagtgaacat aacaaggaca 
aagtgcatct aaatatagag catcaacatg 
tacctacaat gaagtacata acctattatg 
cagcagtttg ggagcttgat aaagcaaatc 
tgcgggagct gtggcttgag atgtggcatg 
ttacaaaagg tggtgtctga tcctcactat 
acattgtaag tattgcaaca aaaagcaaag 
tgggattacg ggaaagctcg atgtgcatgg 
tcaaattata ttttgcaaaa tattctcttt 



19 



caggaaggca 


aagtcttctg 


tatcagtcta 


3000 


ttattaaaca 


actaaaattt 


aagtactttt 


3060 


cagtggctta 


aagtgggaag 


aggtgttgca 


3120 


agcaaaacaa 


aactaaccca 


tttctgaatt 


3180 


aaaatttgtt 


ttaagaaacc 


gaaaaactag 


3240 


tctggaactg 


tggttgatgg 


gaacgctgga 


3300 


gaagttggtt 


ttgaaattat 


cttcttgcaa 


3360 


tggaacactg 


cttgctatgt 


tctctagtac 


3420 


tgattatgaa 


attgatctct 


tataggtacc 


3480 


gctcggacct 


aaagttaaca 


taacaatggc 


3540 


ttatatagat 


caaactttaa 


gtgtactttt 


3600 


ttctttctaa 


aaatcatata 


tctctgtact 


3660 


ttctcattgg 


gaaaataaaa 


caacagccag 


3720 


taaatgttga 


tcatgagcat 


gggtcagaca 


3780 


ttaagtacat 


aaccttttat 


gaagactctg 


3840 


gggagcttga 


taaaagtggg 


tattaa 


3896 


ccacagtcag 


agcatgatgt 


tcagatgtac 


60 


gtgaagatat 


cattcccaat 


atgacaaagt 


120 


aggattctga 


.aagttttcca 


gggacaagaa 


180 


acaggaacat 


tgtcagatct 


ccagctttga 


240 


atattcatcc 


tgataaaaag 


tccaagtttg 


300 


tttcttctat 


aaatgtttga 


gtttgtattg 


360 


cgtgggcctc 


tgagggatga 


ggactgctat 


420 


gctgaacatt 


gtgaatacag 


gttagaatat 


480 


ttgtgtattt 


aggccacctt 


tccccggtca 


540 



caacgatgca 
tctacatctc 
aacaagtttg 



gatatgtatt 
acatcctgtc 
cacaaacatt 



cggggatgtt 
acactatgtg 
tgaagaagaa 
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cacctgggac 
tgatatttaa 
agcgaaatga 



agagttgcag 
gaaactttgt 
ttcagagag 



attgaagagt 
ttggcggaac 
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<210> 13 
<211> 623 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 13 

Met Ala Ser Ser Lys Lys Ser Lys Thr His Lys Lys Lys Lys Glu Val 
15 10 15 

Lys Ser Pro lie Asp Leu Pro Asn Ser Lys Lys Pro Thr Arg Ala Leu 
20 25 30 

Ser Glu Gin Pro Ser Ala Ser Glu Thr Gin Ser Val Ser Asn Lys Ser 
35 40 45 

Arg Lys Ser Lys Phe Gly Lys Arg Leu Asn Phe lie Leu Gly Ala lie 

50 55 60 

Leu Gly lie Cys Gly Ala Phe Phe Phe Ala Val Gly Asp Asp Asn Ala 
65 70 75 80 

Val Phe Asp Pro Ala Thr Leu Asp Lys Phe Gly Asn Met Leu Gly Ser 
85 90 95 

Ser Asp Leu Phe Asp Asp lie Lys Gly Tyr Leu Ser Tyr Asn Val Phe 
100 105 110 

Lys Asp Ala Pro Phe Thr Thr Asp Lys Pro Ser Gin Ser Pro Ser Gly 
115 120 125 

Asn Glu Val Gin Val Gly Leu Asp Met Tyr Asn Glu Gly Tyr Arg Ser 
130 135 140 

Asp His Pro Val lie Met Val Pro Gly Val lie Ser Ser Gly Leu Glu 
145 150 155 160 

Ser Trp Ser Phe Asn Asn Cys Ser lie Pro Tyr Phe Arg Lys Arg Leu 
165 170 175 

Trp Gly Ser Trp Ser Met Leu Lys Ala Met Phe Leu Asp Lys Gin Cys 
180 185 190 

Trp Leu Glu His Leu Met Leu Asp Lys Lys Thr Gly Leu Asp Pro Lys 
195 200 205 

Gly lie Lys Leu Arg Ala Ala Gin Gly Phe Glu Ala Ala Asp Phe Phe 
210 215 220 

lie Thr Gly Tyr Trp lie Trp Ser Lys Val lie Glu Asn Leu Ala Ala 
225 230 235 240 

lie Gly Tyr Glu Pro Asn Asn Met Leu Ser Ala Ser Tyr Asp Trp Arg 
245 250 255 

Leu Ser Tyr Ala Asn Leu Glu Glu Arg Asp Lys Tyr Phe Ser Lys Leu 
260 265 270 



Lys Met Phe lie Glu Tyr Ser Asn lie Val His Lys Lys Lys Val Val 
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275 280 285 

Leu lie Ser His Ser Met Gly Ser Gin Val Thr Tyr Tyr Phe Phe Lys 
290 295 300 

Trp Val Glu Ala Glu Gly Tyr Gly Asn Gly Gly Pro Thr Trp Val Asn 
305 310 315 320 

Asp His lie Glu Ala Phe lie Asn lie Ser Gly Ser Leu lie Gly Ala 
325 330 335 

Pro Lys Thr Val Ala Ala Leu Leu Ser Gly Glu Met Lys Asp Thr Gly 
340 345 350 

lie Val lie Thr Leu Asn lie Leu Glu Lys Phe Phe Ser Arg Ser Glu 
355 360 365 

Arg Ala Met Met Val Arg Thr Met Gly Gly Val Ser Ser Met Leu Pro 
370 375 380 

Lys Gly Gly Asp Val Ala Pro Asp Asp Leu Asn Gin Thr Asn Phe Ser 
385 390 395 400 

Asn Gly Ala lie lie Arg Tyr Arg Glu Asp lie Asp Lys Asp His Asp 
405 410 415 

Glu Phe Asp lie Asp Asp Ala Leu Gin Phe Leu Lys Asn Val Thr Asp 
420 425 430 

Asp Asp Phe Lys Val Met Leu Ala Lys Asn Tyr Ser His Gly Leu Ala 
435 440 445 

Trp Thr Glu Lys Glu Val Leu Lys Asn Asn Glu Met Pro Ser Lys Trp 
450 455 460 

lie Asn Pro Leu Glu Thr Ser Leu Pro Tyr Ala Pro Asp Met Lys lie 
465 - 470 475 480 

Tyr Cys Val His Gly Val Gly Lys Pro Thr Glu Arg Gly Tyr Tyr Tyr 
485 490 495 

Thr Asn Asn Pro Glu Gly Gin Pro Val lie Asp Ser Ser Val Asn Asp 
500 505 510 

Gly Thr Lys Val Glu Asn Gly lie Val Met Asp Asp Gly Asp Gly Thr 
515 520 525 

Leu Pro lie Leu Ala Leu Gly Leu Val Cys Asn Lys Val Trp Gin Thr 
530 535 540 

Lys Arg Phe Asn Pro Ala Asn Thr Ser lie Thr Asn Tyr Glu lie Lys 
545 550 555 560 

His Glu Pro Ala Ala Phe Asp Leu Arg Gly Gly Pro Arg Ser Ala Glu 
565 570 575 



His Val Asp lie Leu Gly His Ser Glu Leu Asn Glu lie lie Leu Lys 
580 585 590 
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Val Ser Ser Gly His Gly Asp Ser Val Pro Asn Arg Tyr lie Ser Asp 
595 600 605 

lie Gin Glu lie lie Asn Glu lie Asn Leu Asp Lys Pro Arg Asn 
610 615 620 



<210> 14 
<211> 432 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 

Met Lys Lys lie Ser Ser His Tyr Ser Val Val lie Ala lie Leu Val 
15 10 15 

Val Val Thr Met Thr Ser Met Cys Gin Ala Val Gly Ser Asn Val Tyr 
20 25 30 

Pro Leu lie Leu Val Pro Gly Asn Gly Gly Asn Gin Leu Glu Val Arg 
35 40 45 

Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val Trp Cys Ser Ser Trp Leu 
50 55 60 

Tyr Pro lie His Lys Lys Ser Gly Gly Trp Phe Arg Leu Trp Phe Asp 
65 70 75 80 

Ala Ala Val Leu Leu Ser Pro Phe Thr Arg Cys Phe Ser Asp Arg Met 
85 90 95 

Met Leu Tyr Tyr Asp Pro Asp Leu Asp Asp Tyr Gin Asn Ala Pro Gly 
100 105 110 

Val Gin Thr Arg Val Pro His Phe Gly Ser Thr Lys Ser Leu Leu Tyr 
115 120 125 

Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser Tyr Met Glu His Leu Val 
130 135 140 

Lys Ala Leu Glu Lys Lys Cys Gly Tyr Val Asn Asp Gin Thr lie Leu 
145 150 155 160 

Gly Ala Pro Tyr Asp Phe Arg Tyr Gly Leu Ala Ala Ser Gly His Pro 
165 170 175 

Ser Arg Val Ala Ser Gin Phe Leu Gin Asp Leu Lys Gin Leu Val Glu 
180 185 190 

Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro Val lie Leu Leu Ser His 
195 200 205 

Ser Leu Gly Gly Leu Phe Val Leu His Phe Leu Asn Arg Thr Thr Pro 
210 215 220 

Ser Trp Arg Arg Lys Tyr lie Lys His Phe Val Ala Leu Ala Ala Pro 
225 230 235 240 
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Trp Gly Gly Thr lie Ser Gin Met Lys Thr Phe Ala Ser Gly Asn Thr 
245 250 255 

Leu Gly Val Pro Leu Val Asn Pro Leu Leu Val Arg Arg His Gin Arg 
260 265 270 

Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro Ser Thr Lys Val Phe His 
275 280 285 

Asp Arg Thr Lys Pro Leu Val Val Thr Pro Gin Val Asn Tyr Thr Ala 
290 295 300 

Tyr Glu Met Asp Arg Phe Phe Ala Asp lie Gly Phe Ser Gin Gly Val 
305 310 315 320 

Val Pro Tyr Lys Thr Arg Val Leu Pro Leu Thr Glu Glu Leu Met Thr 
325 330 335 

Pro Gly Val Pro Val Thr Cys lie Tyr Gly Arg Gly Val Asp Thr Pro 
340 345 350 

Glu Val Leu Met Tyr Gly Lys Gly Gly Phe Asp Lys Gin Pro Glu lie 
355 360 365 

Lys Tyr Gly Asp Gly Asp Gly Thr Val Asn Leu Ala Ser Leu Ala Ala 
370 375 380 

Leu Lys Val Asp Ser Leu Asn Thr Val Glu lie Asp Gly Val Ser His 
385 390 395 400 

Thr Ser lie Leu Lys Asp Glu lie Ala Leu Lys Glu lie Met Lys Gin 
405 410 415 

lie Ser lie lie Asn Tyr Glu Leu Ala Asn Val Asn Ala Val Asn Glu 
420 425 430 



<210> 15 
<211> 552 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 15 

Met Gly Ala Asn Ser Lys Ser Val Thr Ala Ser Phe Thr Val lie Ala 
15 10 15 

Val Phe Phe Leu lie Cys Gly Gly Arg Thr Ala Val Glu Asp Glu Thr 
20 25 30 

Glu Phe His Gly Asp Tyr Ser Lys Leu Ser Gly lie lie lie Pro Gly 
35 40 45 

Phe Ala Ser Thr Gin Leu Arg Ala Trp Ser lie Leu Asp Cys Pro Tyr 
50 55 60 



Thr Pro Leu Asp Phe Asn Pro Leu Asp Leu Val Trp Leu Asp Thr Thr 
65 70 75 80 
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Lys Leu Leu Ser 



Pro Tyr Asn Gin 
100 

Gly Leu Ser Ala 
115 

Leu Ser Thr Val 
130 

lie Glu Ala Asn 
145 

Pro Thr Lys Leu 



Thr Phe Glu Thr 
180 

Ala His Ser Met 
195 

Arg Leu Glu lie 
210 

lie His Ala Tyr 
225 

Ala lie Lys Ser 



Glu Gly Thr Ala 
260 

Leu Met Pro Phe 
275 

His Phe Ser Gly 
290 

Asp Glu Glu Glu 
305 

lie Asn lie Glu 



Thr Leu Phe Lys 
340 

Leu His Gin Leu 
355 

Ala His Arg Ser 
370 

Val Phe Asn Pro 
385 



Ala Val Asn Cys 
85 

Thr Asp His Pro 



lie Thr Glu Leu 
120 

Trp Lys Glu Trp 
135 

Ala lie Val Ala 
150 

Glu Glu Arg Asp 
165, 

Ala Leu Lys Leu 



Gly Asn Asn Val 
200 

Ala Pro Lys His 
215 

Phe Ala Val Gly 
230 

Thr Leu Ser Gly 
245 

Arg Leu Leu Ser 



Ser Lys Asn Cys 
280 

Gly Ala Ala Lys 
295 

Tyr Gin Ser Lys 
310 

lie Pro Ser Thr 
325 

Ala lie Glu Asp 



Lys Lys Tyr Val 
360 

Ser Leu Ala Gly 
375 

Leu Thr Pro Trp 
390 



Trp Phe Lys Cys 
90 

Glu Cys Lys Ser 
105 

Asp Pro Gly Tyr 



Leu Lys Trp Cys 
140 

Val Pro Tyr Asp 
155 

Leu Tyr Phe His 
170 

Arg Gly Gly Pro 
185 

Phe Arg Tyr Phe 



Tyr Leu Lys Trp 
220 

Ala Pro Leu Leu 
235 

Val Thr Phe Gly 
250 

Asn Ser Phe Ala 
265 

Lys Gly Asp Asn 



Lys Asp Lys Arg 
300 

Tyr Ser Gly Trp 
315 

Ser Ala Arg Glu 
330 

Tyr Asp Pro Asp 
345 

Pro Phe Phe Val 



Phe Leu Leu Tyr 
380 

Glu Arg Pro Pro 
395 



Met Val Leu Asp 
95 

Arg Pro Asp Ser 
110 

lie Thr Gly Pro 
125 

Val Glu Phe Gly 



Trp Arg Leu Ser 
160 

Lys Leu Lys Leu 
175 

Ser lie Val Phe 
190 

Leu Glu Trp Leu 
205 

Leu Asp Gin His 



Gly Ser Val Glu 
240 

Leu Pro Val Ser 

255 

Ser Ser Leu Trp 
270 

Thr Phe Trp Thr 
285 

Val Tyr His Cys 



Pro Thr Asn lie 
320 

Leu Ala Asp Gly 

335 

Ser Lys Arg Met 
350 

lie Arg Asn lie 
365 

His Asp Asp Pro 



lie Lys Asn Val 
400 
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Phe Cys lie Tyr Gly Ala His Leu Lys Thr Glu Val Gly Tyr Tyr Phe 
405 410 415 

Ala Pro Ser Gly Lys Pro Tyr Pro Asp Asn Trp lie lie Thr Asp lie 
420 425 430 

lie Tyr Glu Thr Glu Gly Ser Leu Val Ser Arg Ser Gly Thr Val Val 
435 440 445 

Asp Gly Asn Ala Gly Pro lie Thr Gly Asp Glu Thr Val Pro Tyr His 
450 455 460 

Ser Leu Ser Trp Cys Lys Asn Trp Leu Gly Pro Lys Val Asn lie Thr 
465 470 475 480 

Met Ala Pro Gin lie Leu lie Gly Lys lie Lys Gin Gin Pro Glu His 
485 490 495 

Asp Gly Ser Asp Val His Val Glu Leu Asn Val Asp His Glu His Gly 
500 505 510 

Ser Asp lie lie Ala Asn Met Thr Lys Ala Pro Arg Val Lys Tyr lie 
515 520 525 

Thr Phe Tyr Glu Asp Ser Glu Ser lie Pro Gly Lys Arg Thr Ala Val 
530 535 540 

Trp Glu Leu Asp Lys Ser Gly Tyr 
545 550 



<210> 16 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 16 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 



Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 
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Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 
180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 
245 250 255 

Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 
260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

lie Gly His Ser Met Gly Ser Gin lie lie Phe Tyr Phe Met Lys Trp 
325 330 335 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 
370 375 380 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
405 410 415 



Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
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420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 
515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp lie Leu Gly Ser 
610 615 620 

Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

Met Pro Phe Pro Met 
660 



<210> 17 
<211> 387 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 17 

Val Gly Ser Asn Val Tyr Pro Leu lie Leu Val Pro Gly Asn Gly Gly 
15 10 15 



Asn Gin Leu Glu Val Arg Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val 
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20 25 30 

Trp Cys Ser Ser Trp Leu Tyr Pro lie His Lys Lys Ser Gly Gly Trp 
35 40 45 

Phe Arg Leu Trp Phe Asp Ala Ala Val Leu Leu Ser Pro Phe Thr Arg 
50 55 60 

Cys Phe Ser Asp Arg Met Met Leu Tyr Tyr Asp Pro Asp Leu Asp Asp 
65 70 75 80 

Tyr Gin Asn Ala Pro Gly Val Gin Thr Arg Val Pro His Phe Gly Ser 
85 90 95 

Thr Lys Ser Leu Leu Tyr Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser 
100 105 110 

Tyr Met Glu His Leu Val Lys Ala Leu Glu Lys Lys Cys Gly Tyr Val 
115 120 125 

Asn Asp Gin Thr lie Leu Gly Ala Pro Tyr Asp Phe Arg Tyr Gly Leu 
130 135 140 

Ala Ala Ser Gly His Pro Ser Arg Val Ala Ser Gin Phe Leu Gin Asp 
145 150 155 160 

Leu Lys Gin Leu Val Glu Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro 
165 170 175 

Val lie Leu Leu Ser His Ser Leu Gly Gly Leu Phe Val Leu His Phe 
180 185 190 

Leu Asn Arg Thr Thr Pro Ser Trp Arg Arg Lys Tyr lie Lys His Phe 
195 200 205 

Val Ala Leu Ala Ala Pro Trp Gly Gly Thr lie Ser Gin Met Lys Thr 
210 215 220 

Phe Ala Ser Gly Asn Thr Leu Gly Val Pro Leu Val Asn Pro Leu Leu 
225 230 235 240 

Val Arg Arg His Gin Arg Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro 
245 250 255 

Ser Thr Lys Val Phe His Asp Arg Thr Lys Pro Leu Val Val Thr Pro 
260 265 270 

Gin Val Asn Tyr Thr Ala Tyr Glu Met Asp Arg Phe Phe Ala Asp lie 
275 280 285 

Gly Phe Ser Gin Gly Val Val Pro Tyr Lys Thr Arg Val Leu Pro Leu 
290 295 300 

Thr Glu Glu Leu Met Thr Pro Gly Val Pro Val Thr Cys lie Tyr Gly 
305 310 315 320 

Arg Gly Val Asp Thr Pro Glu Val Leu Met Tyr Gly Lys Gly Gly Phe 
325 330 335 
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Asp Lys Gin Pro Glu lie Lys Tyr Gly Asp Gly Asp Gly Thr Val Asn 
340 345 350 

Leu Ala Ser Leu Ala Ala Leu Lys Val Asp Ser Leu Asn Thr Val Glu 
355 360 365 

lie Asp Gly Val Ser His Thr Ser lie Leu Lys Asp Glu lie Ala Leii 
370 375 380 

Lys Glu lie 
385 



<210> 18 
<211> 389 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Leu Lys Lys Glu Gly Leu Lys Ala Lys His Pro Val Val Phe lie Pro 
15 10 15 

Gly lie Val Thr Gly Gly Leu Glu Leu Trp Glu Gly Lys Gin Cys Ala 
20 25 30 

Asp Gly Leu Phe Arg Lys Arg Leu Trp Gly Gly Thr Phe Leu Cys Trp 
35 40 45 

Val Glu His Met Ser Leu Asp Asn Glu Thr Gly Leu Asp Pro Ala Gly 
50 55 60 

lie Arg Val Arg Ala Val Ser Gly Leu Val Ala Ala Asp Tyr Phe Ala 
65 70 75 80 

Pro Gly Tyr Phe Val Trp Ala Val Leu lie Ala Asn Leu Ala His He 
85 90 95 

Gly Tyr Glu Glu Lys Asn Met Tyr Met Ala Ala Tyr Asp Trp Arg Leu 
100 105 110 

Ser Phe Gin Asn Thr Glu Arg Asp Gin Thr Leu Ser Arg Met Lys Ser 
115 120 125 

Asn He Glu Leu Met Val Ser Thr Asn Gly Gly Lys Lys Ala Val He 
130 135 140 

Val Pro His Ser Met Gly Val Leu Tyr Phe Leu His Phe Met Lys Trp 
145 150 155 160 

Val Glu Ala Pro Ala Pro Leu Gly Gly Gly Gly Gly Pro Asp Trp Cys 
165 170 175 

Ala Lys Tyr He Lys Ala Val Met Asn He Gly Gly Pro Phe Leu Gly 
180 185 190 

Val Pro Lys Ala Val Ala Gly Leu Phe Ser Ala Glu Ala Lys Asp Met 
195 200 205 
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20 25 30 

aac cac att cat cat caa cag gga tta ggc cat aag aga aga agg ggt 144 
Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

att agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 192 
lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

aaa aga gac ggg aac ggt aga aaa cgt tgg aga gat tec aga aga ctg 24 0 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

att ttc att ctt ggt gca ttc tta ggt gta ctt ttg ccg ttt age ttt 288 
lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtt cat aat age gat age gac ttg ttt gac aac ttt 336 
Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

gta aat ttt gat tea ctt aaa gtg tat ttg gat gat tgg aaa gat gtt 3 84 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt ata agt teg ttt att gat gat att cag get ggt aac 432 
Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 

tac tec aca tct tct tta gat gat etc agt gaa aat ttt gee gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cgt gat tat aat ate gag gee aaa cat cct gtt gta 528 
Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

atg gtt cct ggt gtc att tct acg gga att gaa age tgg gga gtt att 576 
Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 
180 185 190 



gga. gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 720 
Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 



ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 
245 250 255 



768 
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gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 

Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 
260 265 270 

ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 864 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 912 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 960 

Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 

305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 1008 

lie Gly His Ser Met Gly Ser Gin lie lie Phe Tyr Phe Met Lys Trp 
325 330 335 

gtc gag get gaa ggc cct ctt tac ggt aat ggt ggt cgt ggc tgg gtt 1056 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

aac gaa cac ata gat tea ttc att aat gca gca ggg acg ctt ctg ggc 1104 

Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

get cca aag gca gtt cca get eta att agt ggt gaa atg aaa gat ace 1152 

Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 
370 375 380 

att caa tta aat acg tta gee atg tat ggt ttg gaa aag ttc ttc tea 1200 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 

385 390 395 400 

aga att gag aga gta aaa atg tta caa acg tgg ggt ggt ata cca tea 1248 

Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
405 410 415 

atg eta cca aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 1296 

Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
420 425 430 

tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 1344 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 

cga ttt gaa agg aat acg age gat get ttc aac aaa aat ttg aca atg 13 92 
Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

aaa gac gee att aac atg aca tta teg ata tea cct gaa tgg etc caa 1440 
Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 

465 470 475 480 

aga aga gta cat gag cag tac teg ttc ggc tat tec aag aat gaa gaa 1488 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 
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275 

gca tat tta 
Ala Tyr Leu 
290 

gaa caa ate 
Glu Gin lie 
305 

att gga cat 
lie Gly His 



gtc gag get 
Val Glu Ala 



aac gaa cac 
Asn Glu His 
355 

get cca aag 
Ala Pro Lys 
370 

att caa tta 
lie Gin Leu 
385 

aga att gag 
Arg lie Glu 



atg eta cca 
Met Leu Pro 



tea gag gat 
Ser Glu Asp 
435 

cga ttt gaa 
Arg Phe Glu 
450 

aaa gac gec 
Lys Asp Ala 
465 

aga aga gta 
Arg Arg Val 



gag tta aga 
Glu Leu Arg 



gaa gta cca 



gat eta gaa 
Asp Leu Glu 



gaa ctg ttt 
Glu Leu Phe 
310 

tct atg ggt 
Ser Met Gly 
325 

gaa ggc cct 
Glu Gly Pro 
340 

ata gat tea 
lie Asp Ser 



gca gtt cca 
Ala Val Pro 



aat acg tta 
Asn Thr Leu 
390 

aga gta aaa 
Arg Val Lys 
405 

aag gga gaa 
Lys Gly Glu 
420 

gca ttg aat 
Ala Leu Asn 



agg aat acg 
Arg Asn Thr 



att aac atg 
lie Asn Met 
470 

cat gag cag 
His Glu Gin 
485 

aaa aat gag 
Lys Asn Glu 
500 

ctt cca gaa 



280 

aga cgc gat 
Arg Arg Asp 
295 

cat caa ttg 
His Gin Leu 



tct cag att 
Ser Gin lie 



ctt tac ggt 
Leu Tyr Gly 
345 

ttc att aat 
Phe lie Asn 
360 

get eta att 
Ala Leu lie 
375 

gee atg tat 
Ala Met Tyr 



atg tta caa 
Met Leu Gin 



gag gtc att 
Glu Val lie 
425 

aac aac act 
Asn Asn Thr 
440 

age gat get 
Ser Asp Ala 
455 

aca tta teg 
Thr Leu Ser 



tac teg ttc 
Tyr Ser Phe 



eta cac cac 
Leu His His 
505 

get ccc cac 



agg tac ttt 
Arg Tyr Phe 
300 

agt ggt gaa 
Ser Gly Glu 
315 

ate ttt tac 
lie Phe Tyr 
330 

aat ggt ggt 
Asn Gly Gly 



gca gca ggg 
Ala Ala Gly 



agt ggt gaa 
Ser Gly Glu 
380 

ggt ttg gaa 
Gly Leu Glu 
395 

acg tgg ggt 
Thr Trp Gly 
410 

tgg ggg gat 
Trp Gly Asp 



gac aca tac 
Asp Thr Tyr 



ttc aac aaa 
Phe Asn Lys 
460 

ata tea cct 
lie Ser Pro 
475 

ggc tat tec 
Gly Tyr Ser 
490 

aag cac tgg 
Lys His Trp 



atg aaa ate 



285 

acg aag eta 
Thr Lys Leu 



aaa gtt tgt 
Lys Val Cys 



ttt atg aaa 
Phe Met Lys 
335 

cgt ggc tgg 
Arg Gly Trp 
350 

acg ctt ctg 
Thr Leu Leu 
365 

atg aaa gat 
Met Lys Asp 



aag ttc ttc 
Lys Phe Phe 



ggt ata cca 
Gly lie Pro 
415 

atg aag tea 
Met Lys Ser 
430 

ggc aat ttc 
Gly Asn Phe 
445 

aat ttg aca 
Asn Leu Thr 



gaa tgg etc 
Glu Trp Leu 



aag aat gaa 
Lys Asn Glu 
495 

teg aat cca 
Ser Asn Pro 
510 

tat tgt ata 



aag 912 
Lys 



tta 960 

Leu 

320 

tgg 1008 
Trp 



gtt 1056 
Val 



ggc 1104 
Gly 



ace 1152 
Thr 



tea 1200 

Ser 

400 

tea 1248 
Ser 



tct 1296 
Ser 



att 1344 
He 



atg 1392 
Met 



caa : 1440 

Gin 

480 

gaa 1488 
Glu 



atg 1536 
Met 



tac 1584 
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Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 
515 520 525 

ggg gtg aac aac cca act gaa agg gca tat gta tat aag gaa gag gat 1632 
Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

gac tec tct get ctg aat ttg acc ate gac tac gaa age aag caa cct 1680 
Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

gta ttc etc acc gag ggg gac gga acc gtt ccg etc gtg gcg cat tea 1728 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

atg tgt cac aaa tgg gee cag ggt get tea ccg tac aac cct gec gga 1776 
Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

att aac gtt act att gtg gaa atg aaa cac cag cca gat cga ttt gat 1824 
He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

ata cgt ggt gga gca aaa age gee gaa cac gta gac ate etc ggc age 1872 
He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 615 620 

gcg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat 1920 
Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

etc gtc gag cca cgc caa ttg tct aat ttg age cag tgg gtt tct cag 1968 
Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

atg ccc ttc cca atg taa 1986 
Met Pro Phe Pro Met 
660 



<210> 20 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 20 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 



He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 



Lys Arg Asp Gly 

65 

lie Phe lie Leu 



Gly Ala Tyr His 
100 

Val Asn Phe Asp 
115 

Leu Pro Gin Gly 
130 

Tyr Ser Thr Ser 
145 

Lys Gin Leu Leu 



Met Val Pro Gly 
180 

Gly Asp Asp Glu 
195 

Gly Ser Phe Tyr 
210 

Leu Lys His Val 
225 

Phe Thr Leu Arg 



Ala Gly Tyr Trp 
260 

Gly Tyr Glu Pro 

275 

Ala Tyr Leu Asp 
290 

Glu Gin He Glu 
305 

He Gly His Ser 



Val Glu Ala Glu 
340 

Asn Glu His He 
355 

Ala Pro Lys Ala 
370 



Asn Gly Arg Lys 
70 

Gly Ala Phe Leu 
85 

Val His Asn Ser 



Ser Leu Lys Val 
120 

He Ser Ser Phe 
135 

Ser Leu Asp Asp 
150 

Arg Asp Tyr Asn 
165 

Val He Ser Thr 



Cys Asp Ser Ser 
200 

Met Leu Arg Thr 
215 

Met Leu Asp Pro 
230 

Ala Ala Gin Gly 
245 

He Trp Asn Lys 



Asn Lys Met Thr 
280 

Leu Glu Arg Arg 
295 

Leu Phe His Gin 
310 

Met Gly Ser Gin 
325 

Gly Pro Leu Tyr 



Asp Ser Phe He 
360 

Val Pro Ala Leu 
375 



Arg Trp Arg Asp 
75 

Gly Val Leu Leu 
90 

Asp Ser Asp Leu 
105 

Tyr Leu Asp Asp 



He Asp Asp He 
140 

Leu Ser Glu Asn 
155 

He Glu Ala Lys 
170 

Gly He Glu Ser 

185 

Ala His Phe Arg 



Met Val Met Asp 
220 

Glu Thr Gly Leu 
235 

Phe Glu Ser Thr 
250 

Val Phe Gin Asn 
265 

Ser Ala Ala Tyr 



Asp Arg Tyr Phe 
300 

Leu Ser Gly Glu 
315 

He He Phe Tyr 
330 

Gly Asn Gly Gly 
345 

Asn Ala Ala Gly 



He Ser Gly Glu 
380 



Ser Arg Arg Leu 
80 

Pro Phe Ser Phe 
95 

Phe Asp Asn Phe 
110 

Trp Lys Asp Val 
125 

Gin Ala Gly Asn 



Phe Ala Val Gly 
160 

His Pro Val Val 
175 

Trp Gly Val He 
190 

Lys Arg Leu Trp 
205 

Lys Val Cys Trp 



Asp Pro Pro Asn 
240 

Asp Tyr Phe He 
255 

Leu Gly Val He 
270 

Asp Trp Arg Leu 
285 

Thr Lys Leu Lys 



Lys Val Cys Leu 
320 

Phe Met Lys Trp 
335 

Arg Gly Trp Val 
350 

Thr Leu Leu Gly 
365 

Met Lys Asp Thr 



lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 



Arg lie Glu Arg Val Lys Met Leu 
405 

Met Leu Pro Lys Gly Glu Glu Val 
420 

Ser Glu Asp Ala Leu Asn Asn Asn 
435 440 



Gin Thr Trp Gly Gly lie Pro Ser 
410 415 

lie Trp Gly Asp Met Lys Ser Ser 
425 430 

Thr Asp Thr Tyr Gly Asn Phe lie 
445 



Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 ' 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 
515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp lie Leu Gly Ser 
610 615 620 

Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

Met Pro Phe Pro Met 
660 



<210> 21 
<211> 1986 



37 



atg gtt cct ggt gtc att tct acg gga att gaa age tgg gga gtt att 576 

Met Val Pro Gly Val lie Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 

195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 720 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 768 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 270 

ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 864 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 

275 280 285 

gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 912 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 960 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 1008 

He Gly His Ser Met Gly Ser Gin lie He Phe Tyr Phe Met Lys Trp 
325 330 335 

gtc gag get gaa ggc cct ctt tac ggt aat ggt ggt cgt ggc tgg gtt 1056 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

aac gaa cac ata gat tea ttc att aat gca gca ggg acg ctt ctg ggc 1104 

Asn Glu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 

355 360 365 

get cca aag gca gtt cca get eta att agt ggt gaa atg aaa gat acc 1152 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 

att caa tta aat acg tta gee atg tat ggt ttg gaa aag ttc ttc tea 1200 

He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

aga att gag aga gta aaa atg tta caa acg tgg ggt ggt ata cca tea 124 8 

Arg He Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 
405 410 415 
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Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 
Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 

ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 

245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 
Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 
260 265 270 

ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 
Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 
lie Gly His Ser Met Gly Ser Gin lie lie Phe Tyr Phe Met Lys Trp 
325 330 335 

gtc gag get gaa ggc cct ctt tac ggt aat ggt ggt cgt ggc tgg gtt 
Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

aac gaa cac ata gat tea ttc att aat gca gca ggg acg ctt ctg ggc 
Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

get cca aag gca gtt cca get eta att agt ggt gaa atg aaa gat ace 
Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 
370 375 380 

att caa tta aat acg tta gee atg tat ggt ttg gaa aag ttc ttc tea 
lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

aga att gag aga gta aaa atg tta caa acg tgg ggt ggt ata cca tea 
Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
405 410 415 

atg eta cca aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 
Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
420 425 430 
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tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 1344 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 

cga ttt gaa agg aat acg age gat get ttc aac aaa aat ttg aca atg 13 92 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

aaa gac gec att aac atg aca tta teg ata tea cct gaa tgg etc caa 1440 

Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 

465 470 475 480 

aga aga gta cat gag cag tac teg ttc ggc tat tec aag aat gaa gaa 1488 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca atg 1536 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

gaa gta cca ctt cca gaa get ccc cac atg aaa ate tat tgt ata tac 1584 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys He Tyr 
515 520 525 

ggg gtg aac aac cca act gaa agg gca tat gta tat aag gaa gag gat 1632 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

gac tec tct get ctg aat ttg ace ate gac tac gaa age aag caa cct 1680 

Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 

545 550 555 560 

gta ttc etc ace gag ggg gac gga acc gtt ccg etc gtg gcg cat tea 1728 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

atg tgt cac aaa tgg gec cag ggt get tea ccg tac aac cct gec gga 1776 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

att aac gtt act att gtg gaa atg aaa cac cag cca gat cga ttt gat 1824 

He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

ata cgt ggt gga gca aaa age gee gaa cac gta gac ate etc ggc age 1872 

He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 615 620 

gcg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat 1920 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 

625 630 635 640 

etc gtc gag cca cgc caa ttg tct aat ttg age cag tgg gtt tct cag 1968 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

atg ccc ttc cca atg taa 1986 
Met Pro Phe Pro Met 
660 
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<210> 22 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 22 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 
180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 
245 250 255 



Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 
260 265 270 
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Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

lie Gly His Ser Met Gly Ser Gin lie lie Phe Tyr Phe Met Lys Trp 
325 330 335 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 
370 375 380 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
405 410 415 

Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 
515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 



Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
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580 585 590 

lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp lie Leu Gly Ser 
610 615 * 620 

Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 . 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

Met Pro Phe Pro Met 
660 

<210> 23 
<211> 2312 
<212> DNA 

<213> Schizosaccharomyces pombe 
<400> 23 

atggcgtctt ccaagaagag caaaactcat aagaaaaaga aagaagtcaa atctcctatc 60 
gacttaccaa attcaaagaa accaactcgc gctttgagtg agcaaccttc agcgtccgaa 120 
acacaatctg tttcaaataa atcaagaaaa tctaaatttg gaaaaagatt gaattttata 180 
ttgggcgcta ttttgggaat atgcggtgct ttttttttcg ctgttggaga cgacaatgct 240 
gttttcgacc ctgctacgtt agataaattt gggaatatgc taggctcttc agacttgttt 300 
gatgacatta aaggatattt atcttataat gtgtttaagg atgcaccttt tactacggac 360 
aagccttcgc agtctcctag cggaaatgaa gttcaagttg gtcttgatat gtacaatgag 420 
ggatatcgaa gtgaccatcc tgttattatg gttcctggtg ttatcagctc aggattagaa 480 
agttggtcgt ttaataattg ctcgattcct tactttagga aacgtctttg gggtagctgg 540 
tctatgctga aggcaatgtt ccttgacaag caatgctggc ttgaacattt aatgcttgat 600 
aaaaaaaccg gcttggatcc gaagggaatt aagctgcgag cagctcaggg gtttgaagca 660 
gctgattttt ttatcacggg ctattggatt tggagtaaag taattgaaaa ccttgctgca 720 
attggttatg agcctaataa catgttaagt gcttcttacg attggcggtt atcatatgca 780 
aatttagagg aacgtgataa atatttttca aagttaaaaa tgttcattga gtacagcaac 840 
attgtacata agaaaaaggt agtgttgatt tctcactcca tgggttcaca ggttacgtac 900 
tattttttta agtgggttga agctgagggc tacggaaatg gtggaccgac ttgggttaat 96 0 
gatcatattg aagcatttat aaatgtgagt ctcgatggtt gtttgactac gtttctaact 1020 



tttgaataga 


tatcgggatc 


tttgattgga 


gcacccaaaa 


cagtggcagc 


gcttttatcg 


1080 


ggtgaaatga 


aagatacagg 


tattgtaatt 


acattaaaca 


tgttaatatt 


taatttttgc 


1140 


taaccgtttt 


aagctcaatt 


gaatcagttt 


tcggtctatg 


ggtaagcaat 


aaattgttga 


1200 


gatttgttac 


taatttactg 


tttagtttgg 


aaaaattttt 


ttcccgttct 


gaggtatatt 


1260 


caaaaataca 


aatgtgctct 


actttttcta 


acttttaata 


gagagccatg 


atggttcgca 


1320 


ctatgggagg 


agttagttct 


atgcttccta 


aaggaggcga 


tgttgtatgg 


ggaaatgcca 


1380 


gttgggtaag 


aaatatgtgc 


tgttaatttt 


ttattaatat 


ttaggctcca 


gatgatctta 


1440 


atcaaacaaa 


tttttccaat 


ggtgcaatta 


ttcgatatag 


agaagacatt 


gataaggacc 


1500 


acgatgaatt 


tgacatagat 


gatgcattac 


aatttttaaa 


aaatgttaca 


gatgacgatt 


1560 


ttaaagtcat 


gctagcgaaa 


aattattccc 


acggtcttgc 


ttggactgaa 


aaagaagtgt 


1620 


taaaaaataa 


cgaaatgccg 


tctaaatgga 


taaatccgct 


agaagtaaga 


acattaaagt 


1680 


tactaaatta 


tactaaccca 


aatagactag 


tcttccttat 


gctcctgata 


tgaaaattta 


1740 


ttgcgttcac 


ggggtcggaa 


aaccaactga 


gagaggttat 


tattatacta 


ataatcctga 


1800 


ggggcaacct 


gtcattgatt 


cctcggttaa 


tgatggaaca 


aaagttgaaa 


atgtgagaga 


1860 


atttatgttt 


caaacattct 


attaactgtt 


ttattagggt 


attgttatgg 


atgatggtga 


1920 


tggaacttta 


ccaatattag 


cccttggttt 


ggtgtgcaat 


aaagtttggc 


aaacaaaaag 


1980 


gtttaatcct 


gctaatacaa 


gtatcacaaa 


ttatgaaatc 


aagcatgaac 


ctgctgcgtt 


2040 


tgatctgaga 


ggaggacctc 


gctcggcaga 


acacgtcgat 


atacttggac 


attcagagct 


2100 


aaatgtatgt 


tcattttacc 


ttacaaattt 


ctattactaa 


ctcttgaaat 


aaggaaatta 


2160 


ttttaaaagt 


ttcatcaggc 


catggtgact 


cggtaccaaa 


ccgttatata 


tcagatatcc 


2220 


agtacggaca 


taagttttgt 


agattgcaat 


taactaacta 


accgaacagg 


gaaataataa 


2280 


atgagataaa 


tctcgataaa 


cctagaaatt 


aa 






2312 



<210> 24 
<211> 3685 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 24 

atgcccctta ttcatcggaa aaagccgacg gagaaaccat cgacgccgcc atctgaagag 60 
gtggtgcacg atgaggattc gcaaaagaaa ccacacgaat cttccaaatc ccaccataag 120 
aaatcgaacg gaggagggaa gtggtcgtgc atcgattctt gttgttggtt cattgggtgt 180 
gtgtgtgtaa cctggtggtt tcttctcttc ctttacaacg caatgcctgc gagcttccct 240 
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cagtatgtaa 


cggagcgaat 


cacgggtcct 


ttgcctgacc 


cgcccggtgt 


taagctcaaa 


300 


aaagaaggtc 


ttaaggcgaa 


acatcctgtt 


gtcttcattc 


ctgggattgt 


caccggtggg 


360 


ctcgagcttt 


gggaaggcaa 


acaatgcgct 


gatggtttat 


ttagaaaacg 


tttgtggggt 


420 


ggaacttttg 


gtgaagtcta 


caaaaggtga 


gctcaacaat 


tctcactctt 


cctttatatt 


480 


gggatttgga 


ttggatctga 


tgagatcacg 


cacttgttgc 


ttcttcaaca 


tcactcaaac 


540 


tttaattcca 


tgtttgtctg 


tcttactctt 


tacttttttt 


tttttttgat 


gtgaaacgcfc 


600 


attttcttaa 


gagactattt 


ctgtatgtgt 


aaggtaagcg 


ttccaaggac 


gtaattggct 


660 


tggactattt 


ctgtttgatt 


gttaacttta 


ggatataaaa 


tagctgcctt 


ggaatttcaa 


720 


gtcatcttat 


tgccaaatct 


gttgctagac 


atgccctaga 


gtccgttcat 


aacaagttac 


780 


ttcctttact 


gtcgttgcgt 


gtagatttag 


ctttgtgtag 


cgtataatga 


agtagtgttt 


840 


tatgttttgt 


tgggaataga 


gaagttctaa 


ctacatctgt 


ggaaagtgtg 


ttcaggctgt 


900 


gatagaggac 


tgttgcttta 


ttattcaact 


atgtatatgt 


gtaattaaag 


ctagttcctt 


960 


tttgatcttt 


cagctcaatg 


tgcttttctc 


aatttttttc 


tcaatttcaa 


agtttcacat 


1020 


cgagtttatt 


cacatgtctt 


gaatttcgtc 


catcctcgtt 


ctgttatcca 


gctttgaact 


1080 


cctcccgacc 


ctgctatgga 


tatattaaaa 


aaaaagtgtt 


ttgtgggttg 


catctttgtt 


1140 


acgatctgca 


tcttcttctt 


tcggctcagt 


gttcatgttt 


ttgctatggt 


agagatgggc 


1200 


aatgttattg 


ttgatggtaa 


cagtggtata 


gttgatagta 


tcttaactaa 


tcaattatct 


1260 


ctttgattca 


ggcctctatg 


ttgggtggaa 


cacatgtcac 


ttgacaatga 


aactgggttg 


1320 


gatccagctg 


gtattagagt 


tcgagctgta 


tcaggactcg 


tggctgctga 


ctactttgct 


1380 


cctggctact 


ttgtctgggc 


agtgctgatt 


gctaaccttg 


cacatattgg 


atatgaagag 


1440 


aaaaatatgt 


acatggctgc 


atatgactgg 


cggctttcgt 


ttcagaacac 


agaggttctt 


1500 


ttctcatcgt 


tctttctatt 


attctgttcc 


atgttacgtt 


tctttcttca 


ttacttaagg 


1560 


cttaaatatg 


tttcatgttg 


aattaatagg 


tacgtgatca 


gactcttagc 


cgtatgaaaa 


1620 


gtaatataga 


gttgatggtt 


tctaccaacg 


gtggaaaaaa 


agcagttata 


gttccgcatt 


1680 


ccatgggggt 


cttgtatttt 


ctacatttta 


tgaagtgggt 


tgaggcacca 


gctcctctgg 


1740 


gtggcggggg 


tgggccagat 


tggtgtgcaa 


agtatattaa 


ggcggtgatg 


aacattggtg 


1800 


gaccatttct 


tggtgttcca 


aaagctgttg 


cagggctttt 


ctctgctgaa 


gcaaaggatg 


1860 


ttgcagttgc 


caggtattga 


atatctgctt 


atacttttga 


tgatcagaac 


cttggctctg 


1920 


gaactcaaag 


ttattctact 


aaatatcaat 


tctaataaca 


ttgctatatt 


atcgctgcaa 


1980 


ctgacattgg 


ttgattattt 


ttgctgctta 


tgtaactgaa 


actctcttga 


gattagacaa 


2040 



atgatgaatt 


gataattctt 


acgcattgct 


taacatttgt 


catactgtct 


tttggagggc 


ttccatgctt 


gcattcttta 


ccaattagcg 


tgcatctatg 


gtcttttatt 


tcttcttaat 


gtcacttggt 


tccttaatat 


agaactttac 


aggattctta 


gacaccgata 


tatttagact 


acgcacatgg 


gactcaacaa 


tgtctatgtt 


gcttgattgg 


tcaccggaga 


aaggccacac 


aacttgtggt 


gaagcaggtg 


aaaacggagt 


gatgatatct 


tttgggaaag 


aagtagcaga 


ttttcgagta 


aggacatata 


aatcataata 


atatctgtac 


attttatctg 


gtgaagggtg 


cctgtcgtga 


cgtgtggaca 


gagtaccatg 


ctgagtataa 


ggtctacact 


gctggtgaag 


agatgatggc 


gcgtggtgcc 


gctcatttct 


ccaagtatca 


agatcccaaa 


tactggtcaa 


gattccaact 


gtatccttcg 


tcctgatgca 


gatatggttt 


tcagctcaaa 


gcttacaaag 


ctcagtaata 


ttgaggtgct 


aaagttgata 


tggtttgttc 


tgctttttca 


gattaccgaa 


cggagtgggg 


ataccaacgg 


aacgagcata 


ttgcatcccc 


tttcagatat 


tcacttctgc 


agcaggagtt 


tacaatgtgg 


atggggatga 


gtgtgcaaaa 


gcgtggcgtg 


gcaagacaag 


aagagaatac 


aatcactctc 


cgccggctaa 


tgcccatgtt 


gatatcatgg 


gaaactttgc 


cggaggtaac 


gggtctgata 


taggacatga 


ggagcgtatt 


gacctgaagc 


tgtga 
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ctgtgatgac 


cagtttctta 


gcttcgacga 


2100 


attgaatttt 


gctatggaaa 


gcgctggagc 


2160 


ttattctgct 


tctttcaatt 


ttcttgtata 


2220 


taaagactcg 


ttggattagt 


tgctctatta 


2280 


tttcttcgaa 


aattgcagag 


cgattgcccc 


2340 


tcagaccttg 


cagcatgtaa 


tgagaatgac 


2400 


accgaaggga 


ggtgacacga 


tatggggcgg 


2460 


ctgttgtggg 


aaaaagcaaa 


agaacaacga 


2520 


ttccaagaaa 


agtcctgtta 


actatggaag 


2580 


ggctgcgcca 


tctgagatta 


ataatattga 


2640 


aaccttgtac 


attttgtgat 


tgtatgatga 


2700 


ctgtcaaagg 


tcagagtatc 


ccaaatcaca 


2760 


acatgggaat 


tgctgggatc 


aaagctatcg 


2820 


ctatagatct 


actacattat 


gttgctccta 


2880 


cttatggaat 


tgctgatgat 


ttggatgaca 


2940 


atccgttaga 


gacaaagtaa 


gtgatttctt 


3000 


ttatcagtct 


ttttgttttc 


ggtcttgttg 


3060 


ctgtttctga 


gcctttctca 


aaaaggcttg 


3120 


catgtgactc 


ttgcttataa 


atcctccgtt 


3180 


tgctcctgag 


atggaaatct 


actcattata 


3240 


cgtatacaag 


cttaaccagt 


ctcccgacag 


3300 


tcacgaggag 


gacgaagata 


gctgtctgaa 


3360 


aacagtaccc 


gtcctaagtg 


ccgggtacat 


3420 


attcaaccct 


tccggaatca 


agacttatat 


3480 


cctgttggaa 


gggcgcggga 


cgcagagtgg 


3540 


tttgatcgaa 


gatatcatga 


gggttgccgc 


3600 


ccaggtccac 


tctggcatat 


ttgaatggtc 


3660 



3685 



<210> 25 
<211> 402 
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<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (120) . . (401) 

<220> 

<221> unsure 
<222> 1. .402 

<223> n= a or g or c or t/u 
<22 3> Xaa=unknown 

<400> 25 

agaaacagct ctttgtctct ctcgactgat ctaacaatcc ctaatctgtg ttctaaattc 60 

ctggacgaga tttgacaaag tccgtatagc ttaacctggt ttaatttcaa gtgacagat 119 

atg ccc ctt att cat egg aaa aag ccg acg gag aaa cca teg acg ccg 167 
Met Pro Leu lie His Arg Lys Lys Pro Thr Glu Lys Pro Ser Thr Pro 
15 10 15 

cca tct gaa gag gtg gtg cac gat gag gat teg caa aag aaa cca cac 215 
Pro Ser Glu Glu Val Val His Asp Glu Asp Ser Gin Lys Lys Pro His 
20 25 30 

gaa tct tec aaa tec cac cat aag naa teg aac gga gga ggg aag tgg 263 
Glu Ser Ser Lys Ser His His Lys Xaa Ser Asn Gly Gly Gly Lys Trp 
35 40 45 

teg tgc ate gat tct tgt tgt tgg ttc att ggg tgt gtg tgt gta ace 311 
Ser Cys lie Asp Ser Cys Cys Trp Phe lie Gly Cys Val Cys Val Thr 
50 55 60 

tgg tgg ttt ctt etc ttc ctt tac aac gca atg cct gcg age ttc cct 359 
Trp Trp Phe Leu Leu Phe Leu Tyr Asn Ala Met Pro Ala Ser Phe Pro 
65 70 75 80 

cag tat gta acg gag ccg aat cac gng tec ttt gee tta ccc g 402 
Gin Tyr Val Thr Glu Pro Asn His Xaa Ser Phe Ala Leu Pro 
85 90 



<210> 26 
<211> 643 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . (402) 

<220> 

<221> unsure 

<222> 1 . . 643 

<223> n= a or g or c or t/u 



<400> 26 



c 99 9 a 9 aaa ata 9 ct 9 ct tt: 9 aa 9 999 99 1 9 fct tac tta gcc gat ggt 
Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 
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gat gaa act gtt cca gtt ctt agt gcg ggc tac atg tgt gcg aaa gga 96 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

tgg cgt ggc aaa act cgt ttc age cct gcc ggc age aag act tac gtg 144 
Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

aga gaa tac age cat teg cca ccc tct act etc ctg gaa ggc agg ggc 192 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

ace cag age ggt gca cat gtt gat ata atg ggg aac ttt get eta att 240 
Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 70 75 80 

gag gac gtc ate aga ata get get ggg gca ace ggt gag gaa att ggt 2 88 
Glu Asp Val lie Arg lie Ala Ala Gly Ala Thr Gly Glu Glu lie Gly 
85 90 95 

ggc gat cag gtt tat tea gat ata ttc aag tgg tea gag aaa ate aaa 33 6 
Gly Asp Gin Val Tyr Ser Asp lie Phe Lys Trp Ser Glu Lys lie Lys 
100 105 110 

ttg aaa ttg taa cct atg gga agt taa aga agt gcc gac ccg ttt att 384 
Leu Lys Leu 
115 

gcg ttc caa agt gtc ctg ectgagtgea actctggatt ttgcttaaat 432 
attgtaattt ttcacgcttc attcgtccct ttgtcaaatt tacatttgac aggacgecaa 492 
tgegataega tgttgtaccg ctattttcag cattgtatat taaactgtac aggtgtaagt 552 
tgeatttgee agctgaaatt gtgtagtcgt tttctttacg atttaatanc aagtggcgga 612 
gcagtgcccc aagcnaaaaa aaaaaaaaaa a 643 



<210> 27 

<211> 115 

<212> PRT 

<213> Zea mays 

<400> 27 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 
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Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 70 75 80 

Glu Asp Val lie Arg lie Ala Ala Gly Ala Thr Gly Glu Glu lie Gly 
85 90 95 

Gly Asp Gin Val Tyr Ser Asp lie Phe Lys Trp Ser Glu Lys lie Lys 
100 105 110 

Leu Lys Leu 
115 



<210> 28 
<211> 516 
<212> DNA 

<213> Neurospora crassa 
<220> 

<221> unsure 
<222> 1. .516 

<223> n= a or g or c or t/u 
<400> 28 

ggtggcgaag acganggcgg aagttggagg 
accctctaga gacacgacta ccnttgcacc 
ggaagccgac ggagcgagcc tacatctatc 
tttagatgac gatcgatacg actttgactn 
gcgaaggcga tggcacagtg aaccttatga 
aaatgaagag atacaatcct gcgggctcaa 
cagaacggtt caatccgaga ggagggccga 
tgaaatttat gaagagtaat taaatacggc 
aaaaaaaatt ttttttctaa aaaaaaaaaa 



<210> 29 
<211> 1562 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 29 

atgaaaaaaa tatcttcaca ttattcggta 
acctcgatgt gtcaagctgt gggtagcaac 
ggaggtaacc agctagaggt acggctggac 



ctaacgagaa tgacnctcgg agatggatct 6 0 
cagcctcaag gtntacngtt tntatgggta 12 0 
tggcgcccga tcccgggacg acaacgcatc 180 
aggggcacat tgaccacggt gtgattttgg 240 
gtttggggta cctgtgcaat aaggggtgga 3 00 
aaataaccgt ggtcgagatg ccgcatgaac 360 
atacggcgga cttaaatatg tagaaaaggt 42 0 
acataggtta ctcaatagta tgactaatta 480 
aaaaaa 516 



gtcatagcga tactcgttgt ggtgacgatg 60 
gtgtaccctt tgattctggt tccaggaaac 120 
agagaataca agccaagtag tgtctggtgt 180 



agcagctggt 


tatatccgat 


tcataagaag 


gcagcagtgt 


tattgtctcc 


cttcaccagg 


gaccctgatt 


tggatgatta 


ccaaaatgct 


ggttcgacca 


aatcacttct 


atacctcgac 


tatcattttg 


ggacatttgc 


ataatgaaca 


atatcaatat 


ccatttatat 


gctagtcggt 


tgagtgttat 


gtgattttcc 


attttaaatg 


ctatgtcatg 


agaattataa 


ggacactatg 


gcagagatgc 


cacatcttac 


atggaacatt 


atgttaacga 


ccaaaccatc 


ctaggagctc 


cgggccaccc 


gtcccgtgta 


gcctcacagt 


aaactagcag 


cgagaacgaa 


ggaaagccag 


ttttcgtcct 


ccatttcctc 


aaccgtacca 


actttgttgc 


actcgctgcg 


ccatggggtg 


ctggcaacac 


actcggtgtc 


cctttagtta 


cctccgagag 


taaccaatgg 


ctacttccat 


cgcttgtcgt 


aactccccag 


gttaactaca 


acattggatt 


ctcacaagga 


gttgtgcctt 


agctgatgac 


tccgggagtg 


ccagtcactt 


aggttttgat 


gtatggaaaa 


ggaggattcg 


gagatgggac 


ggttaatttg 


gcgagcttag 


tagagattga 


tggagtttcg 


catacatcta 


ttatgaagca 


gatttcaatt 


attaattatg 



<210> 30 
<211> 3896 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 30 

atgggagcga attcgaaatc agtaacggct 
atttgcggtg gccgaactgc ggtggaggat 
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a gtggtggat 


ggtttaggct 


atggttcgat 


240 


tgcttcagcg 


atcgaatgat 


gttgtactat 


300 


cctggtgtcc 


aaacccgggt 


tcctcatttc 


360 


cctcgtctcc 


ggttagtact 


ttccaagata 


420 


aaatagacat 


aaatttgggg 


gattattgtt 


480 


aatgtgagtg 


ttatgttagt 


atagttaatg 


540 


aagctagaaa 


gttgtcgttt 


aataatgttg 


600 


taaatgtagc 


ttaataataa 


ggtttgattt 


660 


tggtgaaagc 


tctagagaaa 


aaatgcgggt 


720 


catatgattt 


caggtacggc 


ctggctgctt 


780 


tcctacaaga 


cctcaaacaa 


ttggtggaaa 


840 


tgatactcct 


ctcccatagc 


ctaggaggac 


900 


ccccttcatg 


gcgccgcaag 


tacatcaaac 


960 


ggacgatctc 


tcagatgaag 


acatttgctt 


1020 


accctttgct 


ggtcagacgg 


catcagagga 


1080 


ctaccaaagt 


gtttcacgac 


agaactaaac 


1140 


cagcttacga 


gatggatcgg 


ttttttgcag 


1200 


acaagacaag 


agtgttgcct 


ttaacagagg 


1260 


gcatatatgg 


gagaggagtt 


gatacaccgg 


1320 


ataagcaacc 


agagattaag 


tatggagatg 


1380 


cagctttgaa 


agtcgatagc 


ttgaacaccg 


1440 


tacttaaaga 


cgagatcgca 


cttaaagaga 


1500 


aattagccaa 


tgt taatgcc 


gtcaatgaat 


1560 








1562 


tccttcaccg 


tcatcgccgt 


ttttttcttg 


60 


gagaccgagt 


ttcacggcga 


ctactcgaag 


120 
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ctatcgggta 


taatcattcc 


gggatttgcg 


tcgacgcagc 


tacgagcgtg 


gtcgatcctt 


180 


gactgtccat 


acactccgtt 


ggacttcaat 


ccgctcgacc 


tcgtatggct 


agacaccact 


240 


aaggtccgtg 


atcttcattt 


ccttcgctcc 


ttattctgtc 


ggtcgagtca 


cttgttgatg 


300 


aattccaagc 


gaaatatagc 


aatgaagcat 


gtctcgtctc 


tcttattgat 


tcgttcatta 


360 


gtcaacagtg 


acgcttctga 


atctgagttt 


agagtcatat 


aaaacagctg 


actcggcgag 


420 


tgtttcccat 


cgcttttggt 


tcgctaaatg 


tagcgcaatg 


aatgtgtaat 


tagtctgcgc 


480 


tttttattca 


actagatctg 


caagtttttc 


agagtgctca 


atagtagtta 


gaaaatgtta 


540 


ggtcatttta 


cttgtgcatt 


gtgattcttt 


tggttgttgc 


ttactgatcg 


acgtgatgga 


600 


tggtttacag 


cttctttctg 


ctgtcaactg 


ctggtttaag 


tgtatggtgc 


tagatcctta 


660 


taatcaaaca 


gaccatcccg 


agtgtaagtc 


acggcctgac 


agtggtcttt 


cagccatcac 


720 


agaattggat 


ccaggttaca 


taacaggtag 


tttcggattt 


ttctttcttt 


tgagttttct 


780 


tcaatttgat 


atcatcttgt 


tgtgatataa 


tatggctaag 


ttcattaatt 


tggtcaattt 


840 


tcaggtcctc 


tttctactgt 


ctggaaagag 


tggcttaagt 


ggtgtgttga 


gtttggtata 


900 


gaagcaaatg 


caattgtcgc 


tgttccatac 


gattggagat 


tgtcaccaac 


caaattggaa 


960 


gagcgtgacc 


tttactttca 


caagctcaag 


ttagtcctta 


tcaggctaat 


gtcttttatc 


1020 


ttctcttttt 


atgtaagata 


agctaagagc 


tctggtcgtc 


ttcctttttg 


caggttgacc 


1080 


tttgaaactg 


ctttaaaact 


ccgtggcggc 


ccttctatag 


tatttgccca 


ttcaatgggt 


1140 


aataatgtct 


tcagatactt 


tctggaatgg 


ctgaggctag 


aaattgcacc 


aaaacattat 


1200 


ttgaagtggc 


ttgatcagca 


tatccatgct 


tatttcgctg 


ttggtaccgg 


cctactatcc 


1260 


ttaagttacc 


attttatttt 


ttctctaatt 


gggggagtta 


tgttgtgact 


tactggattg 


1320 


agctcgatac 


ctgatttgtt 


gttgatttag 


gagctcctct 


tcttggttct 


gttgaggcaa 


1380 


tcaaatctac 


tctctctggt 


gtaacgtttg 


gccttcctgt 


ttctgaggtg 


acctctgact 


1440 


tctctttagt 


tttaagtagt 


tgatatcaac 


caggtcttat 


aactcactgg 


attttccttt 


1500 


tgaaagtatt 


acttttgtta 


attgaactgc 


tgtacgcgat 


atggtatctg 


tagatcttga 


1560 


agtgctagtt 


atcaaagaac 


atattgtggg 


tagtatacct 


gtcagcggcc 


ttagctaata 


1620 


caaccaaacc 


acatgtacac 


tgatttagtt 


ttcagattat 


tatggtagac 


tttaagttga 


1680 


gaagaaactt 


tgactgaaat 


ctttttattt 


taataggcta 


tgatttgttt 


attgaaatca 


1740 


tgtgacatat 


tgacatgcgc 


ttctcatgtt 


ttttgttggc 


aaggcttcag 


ggaactgctc 


1800 


ggttgttgtc 


caattctttt 


gcgtcgtcat 


tgtggcttat 


gccattttca 


aagaattgca 


1860 


agggtgataa 


cacattctgg 


acgcattttt 


ctgggggtgc 


tgcaaagaaa 


gataagcgcg 


1920 
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tataccactg 


tgatgaagag 


gaatatcaat 


caaaatattc 


tggctggccg 


acaaatatta 


1980 


ttaacattga 


aattccttcc 


actagcggtt 


agactctgta 


tatgcaactg 


taacactaac 


2040 


aaaagtttca 


ccaagaatgt 


tcactctcat 


atttcgttcc 


tttgatgtgt 


atccatcagt 


2100 


tacagaaaca 


gctctagtca 


acatgaccag 


catggaatgt 


ggccttccca 


cccttttgtc 


2160 


tttcacagcc 


cgtgaactag 


cagatgggac 


tcttttcaaa 


gcaatagaag 


actatgaccc 


2220 


agatagcaag 


aggatgttac 


accagttaaa 


gaagtacgta 


cctttctttg 


tgataagaaa 


2280 


tattgctcat 


cgatcatcac 


ttgctggctt 


cttgtacgtc 


aaattgtttt 


gtttaaatct 


2340 


ctatatcaat 


tgttcatatg 


ctttgtcttt 


cttactataa 


gaaacaagta 


taatcagaaa 


2400 


ccttattatt 


gattatcagt 


tctctcctta 


tattatggaa 


tgtctttttc 


gtttacagtt 


2460 


atgaatgcaa 


aagggggtat 


tttagttgat 


tgattctctc 


attctctagt 


ttgttttgac 


2520 


taatagcgtc 


aattttgttt 


ttctagcaaa 


tctttgtgaa 


ttatatataa 


catgctaact 


2580 


atacttttca 


ggttgtatca 


tgatgaccct 


gtttttaatc 


ctctgactcc 


ttgggagaga 


2640 


ccacctataa 


aaaatgtatt 


ttgcatatat 


ggtgctcatc 


taaagacaga 


ggtatgatgc 


2700 


attctcaata 


tcacattatg 


cgttgacttt 


gttattatat 


tccccatttg 


gtttgcaata 


2760 


tctttttgaa 


ttatgattta 


tcttctccct 


tgcatcttat 


gctattaagc 


gttaaaggta 


2820 


ctaaatgtat 


gaagctgtct 


gtcataggtt 


ggttattact 


ttgccccaag 


tggcaaacct 


2880 


tatcctgata 


attggatcat 


cacggatatc 


atttatgaaa 


ctgaaggttc 


cctcgtgtca 


2940 


aggtaatttt 


ccgcaatggc 


agaagtaaaa 


caggaaggca 


aagtcttctg 


tatcagtcta 


3000 


gtggcatgtt 


atctcagttg 


cataagcaaa 


ttattaaaca 


actaaaattt 


aagtactttt 


3060 


ttatcattcc 


ttttgagctt 


agtggatgat 


cagtggctta 


aagtgggaag 


^ggtgttgca 


3120 


tgaaacatga 


cacttgtatc 


aaagataact 


agcaaaacaa 


aactaaccca 


tttctgaatt 


3180 


tcatattatt 


aggagtagtc 


gtgcttttaa 


aaaatttgtt 


ttaagaaacc 


gaaaaactag 


3240 


ttcatatctt 


gattgtgcaa 


tatctgcagg 


tctggaactg 


tggttgatgg 


gaacgctgga 


3300 


cctataactg 


gggatgagac 


ggtaagctca 


gaagttggtt 


ttgaaattat 


cttcttgcaa 


3360 


actactgaag 


actaagataa 


tacttgcttc 


tggaacactg 


cttgctatgt 


tctctagtac 


3420 


actgcaatat 


tgactctccg 


ctacttttat 


tgattatgaa 


attgatctct 


tataggtacc 


3480 


ctatcattca 


ctctcttggt 


gcaagaattg 


gctcggacct 


aaagttaaca 


taacaatggc 


3540 


tccccaggta 


ctctttttta 


gttcctcacc 


ttatatagat 


caaactttaa 


gtgtactttt 


3600 


ctggttatgt 


gttgatttac 


ctccaatttg 


ttctttctaa 


aaatcatata 


tctctgtact 


3660 


cctcaagaac 


ttgtattaat 


ctaaacgaga 


ttctcattgg 


gaaaataaaa 


caacagccag 


3720 



aacacgatgg aagcgacgta catgtggaac taaatgttga tcatgagcat gggtcagaca 3 780 
tcatagctaa catgacaaaa gcaccaaggg ttaagtacat aaccttttat gaagactctg 3840 
agagcattcc ggggaagaga accgcagtct gggagcttga taaaagtggg tattaa 3896 

<210> 31 
<211> 709 
<212> DNA 
<213> tomato 

<400> 31 



ctggggccaa 


aagtgaacat 


aacaaggaca 


ccacagtcag 


agcatgatgt 


tcagatgtac 


60 


aagtgcatct 


aaatatagag 


catcaacatg 


gtgaagatat 


cattcccaat 


atgacaaagt 


120 


tacctacaat 


gaagtacata 


acctattatg 


aggattctga 


aagttttcca 


gggacaagaa 


180 


cagcagtttg 


ggagcttgat 


aaagcaaatc 


acaggaacat 


tgtcagatct 


ccagctttga 


240 


tgcgggagct 


gtggcttgag 


atgtggcatg 


atattcatcc 


tgataaaaag 


tccaagtttg 


300 


ttacaaaagg 


tggtgtctga 


tcctcactat 


tttcttctat 


aaatgtttga 


gtttgtattg 


360 


acattgtaag 


tattgcaaca 


aaaagcaaag 


cgtgggcctc 


tgagggatga 


ggactgctat 


420 


tgggattacg 


ggaaagctcg 


atgtgcatgg 


gctgaacatt 


gtgaatacag 


gttagaatat 


480 


tcaaattata 


ttttgcaaaa 


tattctcttt 


ttgtgtattt 


aggccacctt 


tccccggtca 


540 


caacgatgca 


gatatgtatt 


cggggatgtt 


cacctgggac 


agagttgcag 


attgaagagt 


600 


tctacatctc 


acatcctgtc 


acactatgtg 


tgatatttaa 


gaaactttgt 


ttggcggaac 


660 


aacaagtttg 


cacaaacatt 


tgaagaagaa 


agcgaaatga 


ttcagagag 
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BASF-NAE 33 77 / 99 PCT 1/53 12.09.2000 

09/937779 

JCfERre'dPCITFrO 2 8 SEP 20(jt 

SEQUENCE LISTENING 

<110> BASF AG 

<12 0> A MEW CLASS OF ENZYMES IN THE 3IOSYNTHETIC PATHWAY FOR 
THE PRODUCTION OF TRI ACYLGLYCEROE AND RECOMBINANT DMA 
MOLECULES ENCODING THESE ENZYMES 

<13 0> BASF-NAE-3 3 77 -99 -Sept -2000 

<140> PCT/EP 00/02701 
<141> 2000-03-23 

<160> 31 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1936 

<212> genomic DNA. 

<213> Sacchar omyces cerevisiae 

<220> 

<221> CDS 

<222> (1) . . (1983) 

<400> 1 

atg ggc aca ctg ttt cga aga aat gtc cag aac caa aag agt gat tct 4 8 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

2 5 10 15 



gat gaa aac aat aaa ggg ggt tct get cat aac aag cga gag age aga 9 6 
Asp GIu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 ^ 25 30 



aac cac 
Asn H:e 



at t 
lie 

3 5 



Cat 

His 



cat 
His 



caa cag gga 
Gin Gin Gly 
40 



ita ggc cac aag aga aga agg ggt 
Lieu Gly His Lys Arg Arg Arg Gly 
45 



14 4 



ate agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 



192 



lie Ser Gly Ser 
5 0 



Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 



55 



60 



aaa aget 



ggg aac ggt aga aaa cgt tag aga gat ccc aga aga cig 



240 



Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 



70 



75 



SO 



att ttc ate etc ggc gca tec tta ggt gta etc ttg ccg ttt age tct 288 
lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
S5 90 95 



ggc get 
Gly Ala 



tat cat 
Tyr His 
100 



etc cat aat age 
Val His Asn Sei 



gat age gac ttg ttc gac aac ttt 
Asp Ser Asp Leu Phe Asp Asn Phe 
1 n c n - o 




BASF-NAE 33 77 / 99 PCT 2/58 1 2.09.2000 



etc cca caa ggt ata agt teg ttt att gat gat att cag get ggt aac 
Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 



432 



tac 
Tyr 
145 



tec aca tct tct 
Ser Thr Ser Ser 



tta gat 
Leu Asp 
150 



gat etc 
Asp Leu 



agt gaa 
Ser Glu 
155 



aat ttt gec 
Asn Phe Ala 



gtt ggt 
Vai Gly 
160 



480 



aaa caa etc tta cgc gat tat aat ate gag gee aaa cat cct gtt gta 
Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 
165 170 175 



328 



atg gtt cct ggt gtc att tct acg gga att gaa age tgg gga gtt att 
Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 



576 



gga gac gat gag tgc gat agt tct gcg cat ttt cat aaa egg ctg tgg 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 



624 



gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 



672 



ttg 

Leu 
'225 



aaa cat gta atg 
Lvs His Val Met 



tta cat 
Leu Asp 
230 



cct gaa 
Pro Glu 



aca ggt 
Thr Glv 



rtg gac cca 
Leu Aso Pro 



ccg aac 
Pro Asn 
240 



ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 
245 250 255 



768 



gca ggg tat tgg att tgg aac aaa gtt 
Ala Gly Tyr Trp He Trp Asn Lys Val 
260 265 



Ctc caa 
Phe Gin 



aat ctg gga 
Asn Leu Gly 
270 



gta act 
Val lie 



SI i 



ggc tat gaa ccc aat aaa atg acg agt get gcg 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala 
275 280 



tat gar tgg 
Tyr Asp Trp 
285 



<^g"g ctt 

Arg Leu 



gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 



512 



gaa 
Glu 
305 



caa ate gaa ctg 
Gin He Glu Leu 



ttt cat 
pr : e His 
310 



caa teg 
Gin Leu 



agt ggt 
Ser Gly 
315 



gaa aaa gut 
Glu Lys Val 



tgt tta 
Cys Leu 
320 



960 



att gga cac tct atg ggt tct 
He Gly His Ser Met Gly Ser 
325 



lie Phe 
330 



tac ttt atg 
Tvr Phe Met 



aaa tgg 
Lys Trp 

335 



1008 



gtc gag get gaa ggc cct ctt tac ggt aat ggt 
Val GJu Ala Glu Gly Fro Leu Tyr Gly Asn Gly 
340 345 



ggt cgt ggc 
Gly Arg Gly 
350 



tgg gtt 
Trp Vai 



1056 
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aac gaa cac 
Asn GIu His 
355 

get cca aag 
Ala Pro Lys 
370 

att caa tta 
He Gin Leu 
3S5 

aga act gag 
Arg He Glu 



atg eta cca 
Met Leu Pro 



tea gag gat 
Ser Glu Asp 
435 

cca ttt gaa 
Arg Phe Glu 
4 50 

aaa cac gec 
Lys Asp Ala 
4 65 

aga aga gta 
Arg Arc Val 



gag cca aga 
Glu Leu Arg 



gaa gta cca 
Glu Val Pro 
515 

?gg gtg aac 
Gly Val Asn 
530 

gac tec tct 
Asp Ser Ser 
545 

gta ttc etc 
Val Phe Leu 



£(-g cat cac 
Met Cys His 



ata gat tea 
He Asp Ser 



gca gtt cca 
Ala Val Pro 



aat acg tta 
Asn Thr Leu 
390 

aga gta aaa 
Arg Val Lys 
405 

aag gga gaa 
Lys Gly Glu 
420 

gca ttg aat 
Ala Leu Asn 



agg aat acg 
A.rg Asn Thr 



att aac atg 
He Asn Met 
470 

cat gag cag 
His Glu Gin 
485 

aaa aac gag 
Lys Asn Glu 
5C0 

ctt cca gaa 
Leu Pro Glu 



aac cca act 
Asn Pro Thr 



get ccg aat 
Ala Leu Asn 
550 

acc gag gg-g 
Thr Glu Gly 
565 

aaa Cgg gee 
Lys Trp Ala 
530 



ttc att aat 
Phe He Asn 
360 

get eta att 
Ala Leu lie 
375 

gec atg tat 
Ala Met Tyr 



dtg tta CaQ. 
Met Leu Gin 



gag gtc att 
Glu Val lie 
425 

aac aac act 
Asn Asn Thr 
440 

age gac get 
Ser Asp Ala 
455 

aca tta teg 
Thr Leu Ser 



tac teg ttc 
Tyr Ser Phe 



cca cac cac 

Leu His His 
505 

get ccc cac 

Ala Pro His 
520 

'gaa agg gca 

Glu Arg Ala 
535 

teg acc acc 

Leu Thr He 



gac gga acc 
Asp Gly Thr 



cag ggt get 
Gin Gly Ala 
585 



gca gca ggg 
Ala Ala Gly 



agt ggc gaa 
Ser Gly Glu 
380 

ggt ttg gaa 
Gly Leu Glu 
395 

acg egg ggt 
Thr Trp Gly 
410 

egg ggg gat 

Trp Gly Asp 



gac aca tac 
Asp Thr Tyr 



ttc aac aaa 
Phe Asn Lys 
46 0 

ata tea cct 
He Ser Pro 
475 

ggc tac tec 
Gly Tyr Ser 
490 

aag cac tgg 
Lys His Trp 



atg aaa ate 
Met Lys He 



tat gta tat 
Tyr Val Tyr 
54 0 

cac cac gaa 
Asp Tyr Glu 



get ccg etc 
Val Pro Leu 
570 

tea ccg tac 
Ser Pro Tyr 



acg ctt ctg 
Thr Leu Leu 
365 

acg aaa gat 
Met Lys A.sp 



aag ttc etc 
Lys Phe Phe 



ggt ata cca 
Gly He Pro 
415 

atg aag tea 
Mec Lys Ser 
430 

ggc aat ttc 
Gly Asn Phe 
445 

aat ttg aca 
Asn Leu Thr 



gaa tgg ccc 
Glu Trp Leu 



aag aat gaa 
Lys Asn Glu 
4 9 5 

teg aat cca 

Ser Asn Pro 
510 

tat tgt ata 
Tyr Cys lie 
525 

aag gaa gag 
Lys Glu Glu 



age aag caa 
Ser Lys Gin 



gcg gcg cac 



aac cct gee 
Asn Fro A. la 
590 



ggc 1104 
Gly 



acc 1152 
Thr 



tea 1200 

Ser 

400 

tea 1248 
Ser 



tct 1296 
Ser 



att 1344 
lie 



acg 1392 
Met 



caa 1440 

Gin 

4S0 

gaa 1483 

Glu 



atg 1536 
Me t 



tac 1584 
Tyr 



gat 1632 
Asp 

cct 1530 

Pro 

5 6 0 

tea 1728 
Ser 



gga 1775 
Gly 
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att aac gtt 
lie Asn Val 
595 

ata cgt ggt 
lie Arg Gly 
610 

gcg gag ttg 
Ala Glu Leu 
625 

etc gtc gag 
Leu Val Glu 



atg ccc ttc 
Met Pro ?he 



act att gtg 
Thr He Val 



gga gca aaa 
Gly Ala Lys 



aac gat tac 
Asn Asp Tyr 
630 

cca cgc caa 
Pro Arg Gin 
645 

cca atg taa 
Pro Met 
660 



gaa atg aaa 
Glu Met Lys 
600 

age gec gaa 
Ser Ala Glu 
615 

ate ttg aaa 
He Leu Lys 



ttg tct aat 
Leu Ser Asn 



cac cag cca 
His Gin Pro 



cac gta gac 
His Val Asp 
620 

att gca age 
He Ala Ser 
635 

ttg age cag 
Leu Ser Gin 
650 



gat cga ttt 
Asp Arg Phe 
605 

ate etc ggc 
He Leu Gly 



ggc aat ggc 
Gly Asn Gly 



tgg gtt tct 
Trp Val Ser 
655 



gat 1824 
Asp 



age 1872 
Ser 



gat 1920 

Asp 

640 

cag 1968 
Gin 



1986 
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<210> 2 
<211> 661 
<212> PRT 

<213> Saccharcinyces cerevisiae 
<400> 2 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arc Arg Gly 
35 40 45 

lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 " 70 75 80 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 ~ 120 125 

Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 
ISO 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 " 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 ' 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly ^he Glu Ser Thr Asp Tyr Phe lie 
2^5 250 255 

Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 " 230 285 
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Ala Tyr Leu Asr> Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 " 295 300 

Gin Gin lie Glu Leu Fhe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

lie Gly His Ser Met Gly Ser Gin lie He Phe Tyr Phe Met Lys Trp 
325 330 335 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

Asn Glu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 

He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg He Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 
405 410 415 

Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 
435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lys Asp Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 
515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Fro Leu Val Ala His Ser 
555 570 575 

Met Cys His Lys Trp Aia Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

lie Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Fhe Asp 
595 600 605 
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He Arg Gly Gly Ala Lys Ser Ala 
610 615 

Ala Glu Leu Asn Asp Tyr He Leu 
625 630 

Leu Val Glu Pro Arg Gin Leu Ser 
645 

Met: Pro Phe Pro Met 

660 



Glu His Val Asp He Leu Gly Ser 
620 

Lvs He Ala Ser Gly Asn Gly Asp 
635 6^0 

Asn Leu Ser Gin Trp Val Ser Gin 
650 655 
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<210> 3 

<211> 2312 

<212> genomic DNA 

<213> Schj.zosaccharorr.yces pombe 



<400> 3 

atggcgtctt ccaagaagag caaaactcat aagaaaaaga aagaag^caa atctcctatc 60 

gacttaccaa attcaaagaa accaactcgc gctttgag^g agcaaccttc agcgcccgaa 120 

acacaatctg tttcaaataa atcaagaaaa tctaaatttg gaaaaagact gaattttaia 180 

tugggegcta ttctgggaat atgcggtgct ttttttttcg ctgttggaga egacaatget 240 

gttttcgacc ctgctacgtt agataaatet gggaatatgc taggctcttc agacttgttt 300 

gatgacatta aaggatattt accttataat gtgtctaagg atgcaccttt tactaeggae 360 

aagcctccgc agtctcctag eggaaatgaa gttcaagttg gtcttgatat gtacaatgag 420 

ggatatcgaa gtgaccatcc tgctattatg gttcctggtg ttatcagctic aggactagaa 480 

agttggtcgt ttaataatcg ctcgattcct tactttagga aacgtctttg gggtagctgg 540 

tetatgetga aggcaatgtc ccttgacaag caatgetgge ttgaacattt aatgettgat 600 

aaaaaaaccg gcrtggatcc gaagggaatt aagctgegag cagc^caggg gtttgaagca 660 

gctgattttt ttatcacggg ctattggatt tggagtaaag taattgaaaa ccttgctgca 720 

attggttatg agectaataa catgttaagt gcttct^acg attggcggtc ateacatgea 780 

aattcagagg aacgtgataa atattttcca aagttaaaaa tgctcaccga gtacagcaac 840 

acugtacata agaaaaaggt agtgtcgatt Cctcactcca tgggttcaca g-gttaegtae 900 

tattttttta agtgggctga agctgagggc tacggaaacg gtggaccgac tcgggttaat 960 

gatcatattg aagcatccat aaatgtgagc ctcgatggtc gt:tgactac grttctiaact 1020 

tttgaataga tategggate ttcgattgga gcacccaaaa cagtggcagc gcticctatcg 1080 

ggtgaaacga aagatacagg tategtaate acatcaaaca tgttaatatt taatttttgc 1140 

taaccgcttt aageccaatt gaatcagttt aeggtetacg ggtaagcaat aaattgttga 1200 

gatttgtcac taattcactg taLagtrtga aaaaactttt tzcccgtcct gagg T aai:act 1260 

caaaaataca aatgtgctct ac tt tttcta acttttaata gagagecatg atggttcgca 1320 

ctatgggagg agttagttct: atgettccta aaggaggega tgttgtatgg ggaaatgeca 1380 

g^tgggcaag aaaaatgtgc tgttaatctt ttattaatat ttaggcrcca gatgacctta 1440 

atcaaacaaa Cttttccaat ggtgcaatta ttcgacatag agaagacact gataaggacc 1500 

acgatgaatc tgacatagat gatgeattae aatttttaaa aaatgttaca catgacgact 1560 

ttaaagtcac getagegaaa aattattccc aeggtcttge teggactcaa aaagaagcgt 1620 

taaaaaataa egaaatgecg tctaaaagga taaatccget agaagtaaga acattaaagt 1680 

tactaaatra tactaaccca aatagactag tcctccttac gcccctgata tgaaaatcta 1740 

CCccgrtcac ggggtcggaa aaccaactga gagaggttat tattatacta aeaatcctga 1800 

ggggcaacct gtcattgatt cctcggctaa tgatggaaca aaagt:gaaa aegegagaga 1860 

atr.tatgc.tt caaacattc: a~taactgtt ttattagggt attgttatgg atgatggtga 1920 

eggaacctta ccaatattag cccttggttt ggtgtgcaat aaagtttggc aaacaaaaag 1980 

gtttaaccct gcuaatacaa gcaccacaaa tcatgaaatc aagcatgaac ctgctgcgtt 2040 

tgatcugaga ggacgacctc geteggcaga acaegtcgat: acacctggac attcacagct 2100 

aaatgtatgt tcatttcacc ttacaaatt: ctattactaa ctcttgaaat aaggaaatta 2160 

ttttaaaagt ttcatcaggc caccgtgact eggtaccaaa ccgttataca tcagacatcc 2220 

ageaeggaca taagttctgt agattgeaat taactaacta accgaacagg gaaataacaa 2280 

atgagataaa ccccgataaa cctagaaatt aa 2312 
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<210> 4 

<211> 3685 

<212> genomic DNA 

<213> Arabidopsis thallana 

<400> 4 

acgcccctta ttcaccggaa aaagccgacg cagaaaccat cgacgccgcc acctgaagag 60 
ctggtgcacg atgaggattc gcaaaagaaa ccacacgaat cttccaaatc ccaccataag 120 
aaatcgaacg gaggagggaa gtggtcgtgc atcgattctc gttgttggtt cattgggugt 180 
gtgtgtgtaa cctggtggtc tcttctcttc ccttacaacg caatgcctgc gagctitccct 240 
cagtatgtaa cggagcgaac cacgggtcct ttgcctgacc cgcccggugt taagctcaaa 300 
aaagaaggtc ^taaggcgaa acatcctgct gtcttcatcc ctgggattgt caccggtggg 360 
ctcgagct^t gggaaggcaa acaacgcgct gatggtit tat ttagaaaacg tttgtggggt 420 
ggaacttttg gugaagtcta caaaaggtga gctcaacaat tctcactctt cctttatact 430 
gggatttgga tcggatctga tgagatcacg cacttgttgc ttcttcaaca tcacccaaac 540 
1 ;rcaattcca tgtttgtctg tcttactctt tacttttttt z t 1 1 1 ttgat gtgaaacgct 600 
attttcttaa gagaccattt ctgtaugtgt aaggtaagcg ttccaaggac gtaatcggct 660 
tgoactattt ctgtttgatt gtcaacttta ggatauaaaa cagctgccte ggaactccaa 720 
gtcatctcac tgccaaatct gttgccagac atgccctaga gtccgttcat aacaagttac 780 
-icctttacc gucgttgcgt gtagattcag ctttgngtag cctataacga agtagtgttt 840 
catgttLcgt tgggaataga gaagttctaa ctacatctgt ggaaagtgtg ttcaggctgt 900 
gatagaggac cgti gctt ta ctattcaact atgtatatgt giaattaaag ctagttcctt 960 
t::gacccLt cagctcaatg tgcrtttctc aatttttttc tcaattccaa agL^tcacat 1020 
cgagttcatc cacatgtctt gaatctcgtc catcctcgit ccgitaccca get tcgaacc 1080 
cctcccgacc ctgctatgga cacattaaaa aaaaagtgtc t cgtgggt eg cacctctgct: 114 0 
acgat-tgca tcttcttcLt tcggctcagt gctcatgtct ticgccacggt agagatigggc 1200 
aatgttactg ^tgacggtaa cagcgctaca gttgatagta ccccaaccaa ccaattatc- 1260 
c^cgacLca ggcctctacg tcgggnggaa cacatgtcac ttgacaatga aactgggttg 1320 
gacccagctig guattiagagt tcgagctgca tcaggactcg tggctgctga ctactttgct 13S0 
cctggctact ttgtctgggc agtgctgact gccaaccttg cacacattgg acatgaagag 1440 
aaaaatacgt acatggctgc atatgaccgg cggctctcgc ttcagaacac agaggttc^c 1500 
ttcneatcgt tcttcctact attctgtccc atgttacctt tctatcttca ctaccnaagg 1550 
ct^aaatacg tttcacgttg aattaacagg tacgcgatca gactc:;agc cgi:atgaaaa 1620 
gcaatacaga gttgacggt;: :ctaccaacg guggaaaaaa agcagtcata gctccgcact 1630 
ccatgcgggt cttgtatttt ctacatttta cgaagtgggu tgaggcacca gctccnccgg 1740 
gtggcgggcg tgggccagac tggtg:gcaa agtacatcaa ggcggcgatg aacatcggtg 1800 
gaccattccc tggcgtccca aaagccgttg cagggctttt ctctgccgaa gcaaaggacg 1850 
ttgcagttgc caggtaccga atatccgctc atiacttttga tgaccagaac CLrggctctg 1920 
gaaczcaaag ctattctact aaataccaac cctaacaaca t^gcuatact atcgctgcaa 1930 
ctgacatcgg tzgatcacct ttgccgctta tgnaaccgaa actctcttga gactagacaa 2040 
atgacgaatc gataat:c:c acgcattgcc ctgrgatgac cagtcLctita gcctcgacga 2100 
:aacaCLi:gt cacactgcct: tttggagggc aatgaatttc gctatggaaa gcgccggagc 2160 
t:ccatgc:c gcatccttta ccaa^tagcg tcattctgct cctttcaatt tcctcgtata 2220 
tgcacctacg gccctttatc cct:ctcaan taaagactcg tcggattagt cgctctatca 2230 
gccacttggr ccctraatau agaactttac tctcttcgaa aattgcagag cgattgcccc 2340 
aggatcccia gacaccgata tattcagact :cagacc'ccg cagcaegtaa cgagaatgac 2400 
acgcacatgg gacccaacaa cgcctatgcc accgaaggga ggtgacacga tatggggcgg 2460 
gcttgattgg tcaccggaga aaggccacac ctgttgccgg aaaaagcaaa agaacaacga 252 0 
aacccguggt caaccaggtc aaaacggagc ntccaagaaa agccctgita actacggaag 2 580 
gacgatazct: ttcgggaaag aagtagcaga ggccgcgcca cccgagatta acaatat-iga 2540 
zzztcgagza aggacacata aatcataaca aaccttgtac a:tccgcgac tgtatgatga 2^00 
ctacctgiac atcttatctg g:gaacggtg ctgtcaaagg tcagagcatc ccaaatcaca 2760 
:ctgrcgtga cgtgcggaca gagcaccacg acatgggaat tgctgggatc aaagccaacg 2320 
c-gag:a;aa ggzcZaca.cz gctggugaag ccatacaccc accacatca: gtzgczccza 2330 
agacgatggc gcgtggcgcc gcccacct:: cccacggaac cgc:gacgac ccggacgaca 2540 
ccaagcacca agaccccaaa Zaczggzcaa atccgttaga gacaaagcaa gcgacctctt 3000 
gattccaact gtazcczzzg ccccgatgca ccatcagic: L^tcgct^^c ggtcttgccg 3060 
gacatgctct ccagcccaaa gcttacaaag ccgcttctga qcczzzczca aaaaggc::g 3120 
c~caataaca ttgaggtgct aaag;:cga:a cacgtgactc ttgcttataa azcccccgzz 3 ISO 
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tggcttgttc 
cggagtgggg 
tcgcaLcccc 
agcaggagt t 
gtgtgcaaaa 
aagagaatac 
tgcccatgn c 
cggaggtaac 
ggagcgtatt 



tgctt tttca 
ataccaacgg 
t t tcagat at 
uacaatgtgg 
gcgtggcctg 
aatcacnc tc 
gatatcacgg 
gggtctcata 
gacctgaagc 



gat taccgaa 
aacgagcata 
tcactt ctgc 
atggggatga 
gcaagacaag 
cgccggctaa 
gaaact ttgc 
taggacatga 
tgtga 



tgctcctgag 
cgtatacaag 
tcacgaggag 
aacagtaccc 
atccaaccct 
cccgt tggaa 
ttugatccaa 
ccagg tccac 



atggaaatct 
cctaaccagt 
gacgaagata 
gtcctaagtg 
tccggaatca 
gggcgcggga 
catatcatga 
tctggcatzat 



actcattata 3240 
ctcccgacag 3300 
cctgtctgaa 3360 
ccgggtacat 3420 
agacttatat 3480 
cgcagagtgg 3 540 
gggttgccgc 3600 
ctgaatggtc 3650 
3635 



,;"m v j ,v ^j' %a w v XJ >£Z O ir.; : 
BASF-NAE 33 77/99 PCT 11/58 12.09.2000 



<210> 5 
<211> 2427 
<212> cDNA 

<213> Arabidopsis thaliana 



<400> 5 

agaaacagct 

ctggacgaga 

tgccccttat 

tggcgcacga 

aatcgaacgg 

tgtgtgtaac 

aguatgtaac 

aaagaaggtc 

ctcgagcttt 

ggaactt ttg 

aatgaaactg 

gctgactact 

at tggatatg 

aacacagagg 

cctaccaacg 

ctacattcta 

tggtgtgcaa 

aaagctgt: tg 

gccccagga: 

acgacacgca 

ggcgggcttg 

aacgaaactt 

ggaaggatga 

at tgatt t tc 

tggacagagt 

t acactgctg 

ggtgccgctc 

cccaaatact 

tactcaccat 

tcrcccgaca 

agctgtctqa 

gccgggtaca 

aagacc tata 

acgcagagtg 

agggctgccg 

t ttgaatggt 

ctgtcagctt 

gtcaccatca 

c tcaatacct 

acgataacac 

gggt taaaaa 



ctttgtctct 
t t tgacaaag 
tcatcggaaa 
tgaggat teg 
aggagggaag 
ccggtggttt 
ggagegaate 
ttaaggcgaa 
gggaaggcaa 
gtgaagtcta 
gg ttggatcc 
ttgctcctgg 
aagagaaaaa 
cacgtgacca 
gtggaaaaaa 
tgaagtgggt 
agtatattaa 
cagggct ttc 
Ccctagacac 
cacgggaccc 
at tggccacc 
gtggt gaagc 
tatcttctgg 
gaggtgctgt 
accacgacat 
gtgaagctat 
atttctct ta 
ggtcaaatcc 
acggagtggg 
gt tgcatccc 
aagcaggagt 
tgtgtgcaaa 
taagagaa ta 
gcgcccacgt 
ccggaggcaa 
eggagegtat 
atgtgaatcc 
tgatgcccaa 
ct ttaatat t 
aaaact tget 
aaaaaaaaaa 



ctcgactgat 
teegtatage 
aagecgaegg 
caaaagaaac 
tggtcgtgca 
cttctcttcc 
acgggtcctc 
acatcc tgt t 
acaatgeget 
caaaaggect 
agctggtatt 
ctactt tgtc 
tatgtacatg 
gaccct tagc 
agcagt tata 
tgaggcacca 
ggcggcgatg 
etc tgccgaa 
cgatacattt 
aacaatgee: 
ggagaaaggc 
aegegaaaac 
gaaagaagt a 
caaaggtcag 
gggaattget 
agatctac ta 
tggaac cgc t 
gttagagaca 
gacaccaacg 
etc tcagaca 
tnacaatgtg 
agcgtggcgt 
caaccactcc 
tgatatcatg 
egggtc tgat 
tgacctgaag 
aatact t tga 
ctcacaaaga 
cttac tgaac 
gengecatgt 
aaaaaaa 



ctaacaatcc 
t taacctggt 
agaaaccatc 
cacacgaatc 
t cgat tcttg 
tttacaaege 
t gcctgaccc 
gtcttcattc 
gatggtttat 
c tatgttggg 
agagt tcgag 
tgggcagtgc 
getgeatatg 
cgtatgaaaa 
gttcegcatt 
gctcccctgg 
aacat tggtg 
gcaaaggatg 
agac t tcaga 
a tgt taccga 
cacacc tgtt 
ggagtt tcca 
gcagaggctg 
agtatcccaa 
gggatcaaag 
cat tatgttg 
gatgat t tgg 
aaat taccga 
gcj.ocgci.crcc >_ 
ttcacttcrg 
gatggggacg 
ggcaagacaa 
ccgccggcta 
ggaaacc ttg 
acaggacatg 
c tgegaatat 
aagagagatc 
agectgagaa 
gca a aetata 
t tgtt tgece 



ccaatctgtg 
ttaat ttcaa 
gacgccgcca 
t tccaaatcc 
ttgttggt tc 
aacgcctgcg 
gcccggtgtt 
ctgggat tgt 
ttagaaaacg 
eggaacacat 
ctgta tcagg 
tgat tgctaa 
actggegget 
gtaat ataga 
ccatgggggt 
gtggcggggg 
gacca::tct 
ttgcagt tgc 
cct tgeagea 
agggaggtga 
gtgggaaaaa 
agaaaagtcc 
cgccatctga 
atcacacctg 
ct a tegctga 
ctcctaagat 
atgacaccsa 
atget cctga 
aegt atacaa 
ctcacgagga 
aaacagtacc 
gat ccaaccc 
acccgttgga 
ct t tgatcga 
accaggtcca 
catgarccct 
at cat caat c 
tgatact ttg 
caatcccetc 
tcrtcaaaacc 



ttctaaattc 
gcgacagata 
tctgaagagg 
caccataaga 
attgggtgtg 
agcttccctc 
aagctcaaaa 
caccggtggg 
1 1 tgtggggt 
gtcact tgac 
actcgtgccc 
cct tgcacat 
ttcgtc tcag 
gttgatggt t 
cttgtat cct 
egggecaga t 
tggtgc tcca 
cagagcgacc 
ngcaatgaga 
cacgatatgg 
gcaaaagaac 
tc t taac ta t 
gat naataat 
tcgtgacgtg 
gtacaaggtc 
gatggcgcgt 
gta tcaaga t 
gaeggaaace 
get taaccag 
ggacgaacac 
cgtcccaag c 
tec eggaate 
agggegeggg 
agacaccatg 
ctccggcaca 
t taagc tgt c 
catcaccat c 
ctgcgaaat: 
taatgt ttga 
stcaa 1 1 tgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

9 00 

960 

102 0 

1080 

114 0 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

17 4 0 

1300 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

234 0 

2400 

2427 



BASF-NAE 33 77 / 99 PCT 12/58 12.09.2000 



<210> 6 
<211> 671 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 6 

Mec Pro Leu lie His Arg Lys Lys Pro Thr Glu Lys Pro Ser Thr Pro 
15 10 15 

Pro Ser Glu Glu Val Val His Asp Glu Asp Ser Gin Lys Lys Pro His 
20 25 30 

Glu Ser Ser Lys Ser His His Lys Lys Ser Asn Gly Gly Gly Lys Trp 
35 40 45 

Ser Cys lie Asp Ser Cys Cys Trp Phe lie Gly Cys Val Cys Val Thr 
50 55 60 

Trp Trp Phe Leu Leu Phe Leu Tyr Asn Ala Met Pro Ala Ser Phe Pre 
65 70 75 SO 

Gin Tyr Val Thr Glu Arg lie Thr Gly Pro Leu Pro Asp Pro Pro Gly 
85 90 95 

Val Lys Leu Lys Lys Glu Gly Leu Lys Ala Lys His Pro Val Val Phe 
100 105 110 

lie Pro Gly lie Val Thr Gly Gly Leu Glu Leu Trp Glu Gly Lys Gin 
115 120 125 

Cys Ala Asp Gly Leu Phe Arg Lys Arg Leu Trp Gly Gly Thr Phe Gly 
130 135 140 

Glu Val Tyr Lys Arg Pro Leu Cys Trp Val Glu His Met Ser Leu Asp 
145 150 155 150 

Asn Glu Thr Gly Leu Asp Pro Ala Gly lie Arg Val Arg Ala Val Ser 
165 170 175 

Gly Leu Val Ala Ala Asp Tyr Phe Ala Pro Gly Tyr Phe Val Trp Ala 
180 185 190 

Val Leu Lie Ala Asn Leu Ala His lie Gly Tyr Glu Glu Lys Asn Met 
195 200 205 

Tyr Met Ala Ala Tyr Asp Trp Arg Leu Ser Phe Gin Asn Thr Glu Val 
210 215 220 

Arg Asp Gin Thr Leu Ser Arg Met Lys Ser Asn He Glu Leu Met Val 
225 230 235 240 

Ser Thr Asn Gly Gly Lys Lys Ala Val lie Val Pro His Ser Mez Gly 
245 250 255 

Val Leu Tyr Phe Leu His Phe Met Lys Trp Val Glu Ala Pro Ala Pro 
260 265 270 

Leu Gly Gly Gly Gly Gly Pro Asp Trp Cys Ala Lys Tyr He Lys Ala 
275 2S0 285 
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Val Met Asn He Giy Gly Pro Phe Leu Gly Val Pro Lys Ala Val Ala 
290 295 300 

Gly Leu Phe Ser Ala Glu Ala Lys Asp Val Ala Val Ala Arg Ala He 
305 310 315 320 

Ala Pro Gly Phe Leu Asp Thr Asp He Phe Arg Leu Gin Thr Leu Gin 
325 330 335 

His Val Met Arg Met Thr Arg Thr Trp Asp Ser Thr Met Ser Met Leu 
340 345 350 

Pro Lys Gly Gly Asp Thr He Trp Gly Gly Leu Asp Trp Ser Pro Glu 
355 360 365 

Lys Gly His Thr Cys Cys Gly Lys Lys Gin Lys Asn Asn Glu Thr Cys 
370 375 380 

Gly Glu Ala Gly Glu Asn Gly Val Ser Lys Lys Ser Pro Val Asn Tyr 
385 390 395 400 

Gly Arg Met He Ser Phe Gly Lys Glu Val Ala Glu Ala Ala Pro Ser 
405 410 415 

Glu He Asn Asn lie Asp Phe Arg Gly Ala Val Lys Giy Gin Ser lie 
420 425 430 

Pro Asn His Thr Cys Arg Asp Val Trp Thr Glu Tyr His Asp Met Gly 
435 440 445 

lie Ala Gly He Lys Ala He Ala Glu Tyr Lys Val Tyr Thr Ala Gly 
450 455 460 

Glu Ala He Asp Leu Leu His Tyr Val Ala Pro Lys Mer Met Ala Arg 
465 4 70 475 480 

Giy Ala Ala His Phe Ser Tyr Giy He Ala Asp Asp Leu Asp Asp Thr 
485 490 495 

Lys Tyr Gin Asp Pro Lys Tyr Trp Ser Asn Pro Leu Glu Thr Lys Leu 
SCO 505 510 

Pro Asn Ala Pro Glu Met Glu He Tyr Ser Leu Tyr Gly Val Gly lie 
515 520 525 

Pro Thr Glu Arg Ala Tyr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser 
530 ^ 535 540 

Cys lie Pro Phe Gin He Phe Thr Ser Ala His Glu Glu Asp Glu Asp 
545 550 555 560 

Ser Cys Leu Lys Ala Gly Val Tyr Asn Vai Asp Gly Asp Glu Thr Vai 
555 570 575 

Pro Val Leu Ser Ala Giy Tyr Met Cys Ala Lys Ala Trp Arg Gly Ly^ 
530 585 590 

Thr Arg Phe Asn Pro Ser Gly lie Lys Thr Tyr He Arg Glu Tyr Asn 
59 5 600 6 05 
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His Ser Pro Pro Ala Asn Leu Leu 
510 615 

Ala His Val Asp lie Met Gly Asn 
625 630 

Arg Val Ala Ala Gly Gly Asn Gly 
645 

His Ser Gly lie Phe Glu Trp Ser 
660 



Glu Gly Arg Gly Thr Gin Ser Gly 
620 

Phe Ala Leu lie Glu Asp He Met 

635 640 

Ser Asp He Gly His Asp Gin Val 
650 655 

Glu Arg lie Asp Leu Lys Leu 
665 670 
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<210> 7 
<211> 643 
<212> c DMA 
<213> Zea mays 

<220> 
<221> CDS 
<222> (1) . . (402) 

<400> 7 

ccg aaa aaa ata act get ttg aag ggg ggt gtt tac tea gec gat gg L 

Arg Gl- Lys He Ala Ala Leu Lys Gly Giy Val Tyr Leu Ala Asp Gly 
1 5 10 15 

gat gaa act gtt cca gtt ctt agt gcg ggc tac atg tgt gcg aaa gga 
Asp Glu ^hr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

cag cgt gac aaa act cgt ttc age cct gec ggc age aag act tac gtg 
Trp Ara Gly Lvs Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr v/al 
' 35 ' 40 45 

aga aaa tac age cat teg cca ccc tct act etc ctg gaa ggc agg ggc 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg biy 
- ry 5 5 SO 

acc caa age agt gca cat get gac aca a:g ggg aac ttt get eta ate 
^hr Gin ^e- Giv Ala His Val Asd He Men Gly Asn Phe Ala Leu ne 
" 70 75 30 

gag gac gtc ate aga ata get get ggg gca acc ggt gag gaa act ggt 
Glu Asq Val Tie Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Guy 
85 90 55 

ggc gat cag gtt tat tea gat ata tec aag tgg cca gag aaa ate aaa 
Glv A c p Val Tvr Ser Asp He Phe Lys Trp Ser Glu Lys Tie lvs 

100 HO 

ttg aaa ccg caa cct atg gga agt Caa aga agt gec gac ccg ttc act 
Leu Lys Leu 
115 

gca ttc caa agt gtc ccg cccgagcgca actceggatt ttgcttaaat 
attgtaatct ctcacgcttc attcgtcccc tegtcaaate tacatttgac aggacgecaa 
tgegataega tgtcgtaccg ctattctcag catcgcacac taaactgeae aggegtaage 
cgcattcgcc agcCgaaatc gtgtagccgt ttcctctacg atttaatanc aagtggcgga 
acagtccccc aagcnaaaaa aaaaaaaaaa ci 



48 



192 

240 

288 

336 

384 

4 32 
492 
552 
612 
643 
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<210> 8 
<211> 115 
<212> PRT 
<213> Zea mays 

<400> 3 

Arg Glu Lys He Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp G3y 

1 5 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 

20 25 30 

Trp Arg Gly Lys Thr Arg Phe Ser Fro Ala Gly Ser Lys Thr Tyr Val 

35 40 45 

Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 

50 " 55 60 

Thr Gin Ser Gly Ala Kis Val Asp He Met Gly Asn Phe Ala Leu He 
65 " 70 75 80 

Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 

85 90 95 

Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 110 

Leu Lys Leu 
115 
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<210> 9 
<211> 616 
<212> cDNA 

<213> Neurospora crassa 



<400> 9 

ggcggcgaag 

accctctaga 

ggaagccgac 

tttagatgac 

gcgaaggcga 

aaatgaagag 

cagaacggtt 

ggcagaatCL 

attccatcac 

Laaacacggc 

aaaaaaaaaa 



acganggcgg 
gacacgacta 
ggagcgagcc 
gatcgatacg 
tggcacagtg 
atacaatcct 
caatccgaga 
aaacgagtac 
tagtaatact 
acataggtta 
aaaaaa 



aagt tggagg 
ccnttgcacc 
tacatctacc 
act t tgactn 
aacctcatga 
gcgggctcaa 
ggsigggccga 
at t ct taaag 
cttaaatatg 
ctcaatagta 



ctaacgagaa 
cagcctcaag 
tggcgcccga 
aggggcacat 
gtttggggta 
aaataaccgt 
atacggcgga 
tggcggcagg 
tagaaaaggt 
tgactaatta 



tgacnctcgg 
gtntacngtt 
tcccgggacg 
tgaccacggt 
cctgtgcaat 
ggtcgagatg 
tcacgtggat 
ccgaggcgat 
tgaaatteat 
aaaaaaaatt 



agatggacct 
cntatgggta 
acaacgcacc 
gtgattttgg 
aaggggcgga 
ccgcatgaac 
at tctaggaa 
acaattgagg 
gaagagtaat 
ttttttctaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

616 
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<210> 10 

<211> 1562 

<212> genomic DNA 

<213> ArabidoDsis thaliana 



<400> 10 

atgaaaaaaa 

acctcgatgt 

ggaggtaacc 

agcagctggt 

gcagcagtgc 

gaccctgatc 

ggtucgacca 

tatcattt tg 

atatcaatat 

tgagtgttat 

ctatgtcatg 

gcagagatgc 

atgt taacga 

cgggccaccc 

aaactagcag 

ttttcgtcct 

acct tgutgc 

ctggcaacac 

cctccgagag 

cgctcgtcgt 

acatcggatt 

age tgatgac 

aggt t ttgat 

gagaegggae 

tagagat tga 

ttatgaagca 

ga 



tatcttcaca 
gtcaagctgc 
agctagaggt 
tatatccgat 
tattgtctcc 
tggatgac ta 
aatcacttct 
ggacat ttgc 
ccatctacat 
gtgat tt tec 
agaattataa 
cacaccttac 
ccaaaccatc 
gtcccgtgta 
cgagaacgaa 
ccat ctcc tc 
actege tgcg 
actegctg tc 
taaccaatgg 
aactccccag 
ctcacaagga 
t ccgggagtg 
gtaeggaaaa 
ggttaatt tg 
tggagt ttcg 
aatt tcaatt 



ttattcggta 
gggtagcaac 
acggctiggac 
tcataagaag 
ctccaccagg 
ecaaaatget 
ataccccgac 
ataatgaaca 
getagteggt 
attt taaatg 
ggacac tatg 
atggaacatt 
ctaggagctc 
gcctcacagt 
ggaaagccag 
aaccgtacca 
ccatggggtg 
cct t tagt ta 
ctacttccat 
gt taactaca 
gttgtgcccc 
ccagtcac tt 
ggaggatt eg 
gegage t tag 
catacatc ta 
at taattatg 



gtcatagega 
gtgtaccc t t 
agagaataca 
agtggtggat 
tgc t teageg 
cctggtgt cc 
cctcgtctcc 
aaatagacac 
aatgtgagtg 
aagc tagaaa 
taaatgtagc 
tggtgaaagc 
caea tgat 1 1 
tcctacaaga 
tgatactcc t 
ccccztcatg 
ggacga t etc 
accc t t t get 
c taccaaagt 
cagcttacga 
acaagaeaag 
ecatataegg 
at aagcaacc 
cacct t t gaa 
tac t taaaga 
aat t acrccaa 



tactcgrtgt 
tgat t ctggt 
agecaagt ag 
ggtt taggct 
atcgaatgat 
aaacccgggt 
cgttagtact 
aaatt tgggg 
ttatgttagt 
gt tgtcgttt 
tt aataataa 
tc tagagaaa 
eaggtaegge 
ccteaaacaa 
ctcccatagc 
gcgccgcaag 
tcaga tgaag 
ggtcagaegg 
gt t tcacgac 
ga tagategg 
agtgc tgect 
gagaggagtt 
agagat taag 
agtcgatagc 
cgacat cgea 
tgt taatgee 



ggtgacgatg 


60 


tccaggaaac 


120 


tgtctggtgt 


180 


atggt tcgac 


240 


gttgtactat 


BOG 


tcctcatttc 


360 


ttccaagata 


420 


gattattgtt 


480 


atagt taatg 


540 


aataatgttg 


600 


ggttcgattc 


660 


aaatgcgggt 


720 


ctggctgct t 


780 


t tggtggaaa 


340 


ctaggacgac 


900 


LacatCcaac 


960 


acacc tget t 


1020 


catcagagga 


1080 


agaactaaac 


114 0 


1 1 1 tt tgcag 


1200 


ttaacagagg 


1260 


gat acacegg 


1320 


tatggagatg 


1380 


t tgaacaccg 


1440 


c 1 1 aaagaga 


1500 


gtcaatgaa t 


1560 




1562 
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<210> 11 

<211> 3396 

<212> grenomic DNA 

<213> Arabidopsis thaliana 



<400> 11 

atgggagcga 

atttgcggtg 

c tatcggcta 

gactgtccat 

aaggtccgcg 

aattccaagc 

gtcaacagtg 

tgtt tcccat 

Lttttattca 

ggtcatttta 

tggtttacag 

taatcaaaca 

agaatcggat 

tcaatetgat 

tcaggtcctc 

gaagcaaacg 

gagcgtgacc 

ccccctt tt t 

t Ltgaaacr g 

aataatgtct 

t tgaagtggc 

t taagttacc 

agctcgatac 

tcaaatctac 

tctctttagt 

tgaaagtatt 

agtgccagtt 

caaccaaacc 

gaagaaactt 

tgtgacatat 

ggt tgttgtc 

sggg tgataa 

tacaccac^g 

ttaacat tga 

aaaagtt tea 

tacagaaaca 

tCLcacagcc 

agatagcaag 

rattgctcat 

ctatatcaat 

ccctat:ac t 

acgaatgcaa 

taat agcgcc 

a:actttcca 

ccacctataa 

ancctcaaia 

cccLCLtgaa 

c taaatgtat 

taacctgata 

agcnaatuzc 

gtgccatgtt 

:tat :a:ccc 

tgaaaca tga 



attcgaaatc 
gccgaactgc 
taatcat tec 
acactccgtt 
atcttcattt: 
gaaatatagc 
acgcttctga 
cgcttttggt 
actagatccg 
cttgtgcact 

gaccatcccg 
ccaggtcaca 
atcatcutgt 

caat tgtcgc 
tttact^tca 
atgtaagata 
ctttaaaact 
tcagataccc 
tcgatcagca 
attctauct 
ctgat ttgtc 
tctctcnggc 
at caagcagt 
actcctgc na 
atcaaagaac 
acatgtacac 
tgactgaaau 
cgacatgcgc 
caattctt cc 
cacat tctgg 
tgatgaagag 
aat tcctccc 
ccaagaatgt 
■ gctctagtca 
cgtgaaccag 
aggatgt t ac 
cgatcaccac 
tgctcacatg 
gat tatcagt 
aagggggtat 
aatttcgt tt 
ggttgtatca 
aaaatgcac: 
tcacat tatg 
ttatgattta 
gaagczgtcc 
actggazca: 
ccgcaacggc 
acc - :cag::g 
rztcgagcc t 
cactcgtacc 



agtaacggct 

ggcggaggat 

gggatttgcg 

ggact tcaat 

ccttcgctcc 

aatgaagcat 

atctgagctt 

tcgctaaatg 

caagtttccc 

gtgattcttt 

ctgccaactg 

agtgtaagtc 

taacaggcag 

cgtgatat aa 

ctggaaagag 

tgttccacac 

caaccccaag 

agccaagagc 

ccgtggcggc 

tctggaatgg 

tacecatgct 

ccetccaatt 

gccgat ccag 

gaaacgtttg 

tgatatcaac 

at tgaactgc 

acattgtggg 

tgattcagtc 

cttcttattt 

Ltctcatatt 

gcgtcgtcat 

acgcatt t tc 

gaatat caat 

actagcggtt 

tcactc teat 

aca tgaccag 

cagatgggac 

accagccaaa 

ccgctggctt 

ctt tgtct tt 

tctctcct ta 

c ttagtcgat 

t ic tagcaaa 

tgatgaccct 

t tgcat at at 

cgt tgact t c 

ccttctccct 

gt tataggc t 

caeggatate 

agaag taaaa 

cataagcaaa 

agtgga tgat 

aaagat aact 



tcct tcaccg 
gagaccgagt 
tcgacgcagc 
ccgctcgacc 
ttattctgtc 
gtctcgt etc 
agagtcatat 
tagegcaatg 
agagtgc tea 
tggt tgt tgc 
ctggtttaag 
acggcctgac 
t ttcggattt 
tatggctaag 
tggcttaagt 
gateggagat 
c tagtcct ta 
cccggccgtc 
cc ctctatac 
ctgaggc tag 
tacttccctg 
gggggagtta 

gagctcctct 
gccttcc tgt 
caggtct tat 
tgtacgegat 
tagtiatacct 
ttcagat tat 
taataggcta 
tat tat tggt 
tgtggct tat 
ctgggggtgc 
caaaata t tc 
agactctgta 
atttcgt tec 
catggaa tgt 
tcttt tcaaa 
gaagtacgta 
c ttgtacgtc 
ct tactataa 
tat tatggaa 
tgattctctc 
tetctgtgaa 
gc t tttaatc 
ggtgc taatc 
gttat ta ta t 
tgcatcttat 
ggttatt act 
at ttatgaaa 
caggaaggca 
t tat taaaca 
cagtggc tta 
agcaaaacaa 



tcatcgccgt 
ttcaeggega 
tacgagegtg 
tegtatgget 
ggtcgagtca 
ccttat tgat 
aaaacagctg 
aa tgtgtaaat 
atagtagt ta 
ttactcatcg 
tgtacggtgc 
agtggtcttt 
ttct ttctcc 
ttcattaatt 
ggtgtgttga 
tgtcaccaac 
tcaggc taat 



catt tgecca 
aaat tgeace 
c tggtacccg 
tgccgtgac t 
ret tggttct 
t tctgaggtg 
aac tcactgg 
aeggtate tg 
cccagcggcc 
t at ggtagac 
teat t tec t t 
aaggc t tcag 
gecat tt tea 
t gcaaagaaa 
tggctggccg 
tatgeaactg 
tt tcatgtgc 
ggect tccca 
gcaat agaag 
cc tttctt tg 
aaattgtt tt 
gaaacaagta 
tgtctttttc 
attctctagt 
1 1 a t a t a t a a 
etc tgact cc 
taaagacaga 
zccccat 1 1 g 
gctat taagc 
t tgccccaag 
c tgaaggt t c 
aagtct tc tg 
ac taaaat 1 1 
aagtgggaag 
aactaaccca 



ttt tttcttg 
ctactcgaag 
ctcgatcctt 
agacaccacc 
cttgttgatg 
tegttcatta 
acteggegag 
tagtccgcgc 
gaaaatgt ta 
acgtgatgga 
tagatccLta 
cagccatcac 
tgagc t ttct 
tggtcaatt t 
gut tggtata 
caaat tggaa 
gtcttt tatc 
caggtcgacc 
ctcaatgggt 
aaaacat ta t 
cct actatcc 
tactggattg 
gt tgaggcaa 
acctctgac t 
attttcctt t 
tagatct tga 
tt age taat a 
tttaag ttga 
actgaaauca 
ggaac tget c 
aagaat tgea 
gat aagcgcg 
acaaatatt a 
caacactaac 
atccatcag t 
ccc 1 1 1 tgtc 
accatgaccc 
tgataagaaa 
gt t taaatc t 
taatcagaaa 
gtt tacagt t 
ttgc 1 1 tgac 
catgc t aac t 
t tgegagaga 
ggtat gatgc 
gt t tgcaat a 
c: taaaggta 
tgccaaacc t 
cctcgtgtca 
tatcagtct a 
aagtactt ct 
acgtgt t gca 
t etc tcaat t 



60 
120 

180 

240 

300 

360 

4 20 

480 

540 

600 

660 

720 

7S0 

840 

3 30 

950 

1020 

1080 

114 0 

1200 

1250 

1320 

13 SO 

1440 

1500 

1560 

1620 

1680 

1740 

: S00 

1360 

1 920 

1930 

2040 

2:co 

2150 
2220 
2230 
2340 
2400 
24 50 
2520 
2580 
2540 
2700 
2750 
2320 
2330 
2340 
3000 
3050 
3120 
3130 
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tcatac tatt 
t tcatatctt 
cctataactg 
ac tactgaag 
accgcaatat 
c tatcat tea 
tccccaggta 
c tggttatgt 
ccccaagaac 
aacacgatgg 
tcatagccaa 
agagcat tec 



aggagtagtc 
gat tgtgcaa 
gggatgagac 
actaagataa 
tgactctccg 
ctctcttggt 
ctcnttttca 
gt tgatt tac 
ttgtattaat 
aagcgacgta 
catgacaaaa 
ggggaagaga 



gtgett ttaa 
tat ctgeagg 
ggtaagctca 
tacctgcttc 
ctacttttac 
gcaagaat tg 
gtt cctca.ee 
ctccaatttg 
ctaaacgaga 
catgtggaac 
gcaccaaggg 
accgcagcct 



aaaatttgtt 
tctggaactg 
gaagttggta 
tggaacactg 
tgattatgaa 
get eggaect 
t ta t atagat 
ttctttctaa 
ttctcattgg 
taaatgttga 
ttaagtacat 
gggagct tga 



ttaagaaacc 
tggttgatgg 
ttgaaattac 
ct tgctatgt 
attgatctct 
aaagt taaca 
caaactttaa 
aaatcatat a 
gaaaa t aaaa 
tcatgagcat 
aacct tttat 
taaaagtggg 



gaaaaactag 
gaaege tgga 
ettcttgeaa 
tctc t agtac 
tataggtacc 
taacaatggc 
gtgtactttt 
tctctgtact 
caacagccac 
gggtcagaca 
gaagactctg 
tat t aa 



3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3896 
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21/58 



12.09.2000 



<210> 12 
<211> 709 
<212> cDNA 

<213> Lycopersicon esculentum 



<400> 12 

ctggggccaa 

aagtgcacct 

tacctacaat 

cagcagt ttg 

tgcgggagct 

ttacaaaagg 

acattgtaag 

tgggattacg 

tcaaattata 

caacgatgca 

tctacacctc 

aacaagttcg 



aagtgaacat 
aaatatagag 
gaagtacata 
ggagc ttgat 
gtggcttgag 
tggtgtctga 
tattgcaaca 
ggaaagctcg 
ttttgcaaaa 
gatatgtatt 
acatcctgtc 
cacaaacact 



aacaaggaca 
cat caacatg 
acctattatg 
aaagcaaatc 
atgtggcatg 
tcctcactat 
aaaagcaaag 
atgtgcatgg 
tattctcttt 
cggggatgt.t 
acactatgtg 
tgaagaagaa 



ccacagtcag 
gugaagatat 
aggattctga 
acaggaacat 
atatccatcc 
t c tcctctat 
cgtgggcctc 
gctgaacatt 
ttgtgtattt 
cacctgggac 
tgatatttaa 
agcgaaatga 



agcatgatgt 
cattcccaat 
aagtnttcca 
tgtcagatcti 
tgataaaaag 
aaatgtt tga 
tgagggatga 
gtgaacacag 
aggccacct t 
agagt tgcag 
gaaacctcgt 
ttcagagag 



tcagatgtac 
atgacaaagt 
gggacaagaa 
ccagccttga 
tccaagt c tg 
gtt tgtattig 
ggactgctat 
gt tagaatat 
tccccggcca 
attgaagagt 
tnggcggaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

7 09 
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<210> .13 
<211> 623 
<212> PRT 

<213> Schi zosaccharcmyces pombe 
<400> 13 

Met Ala Ser Ser Lys Lys Ser Lys Thr Kis Lys Lys Lys Lys Glu Val 
15 10 15 

Lys Ser Pro lie Asp Leu Pro Asn Ser Lys Lys Pro Thr Arg Ala Leu 
20 " 25 30 

Ser Glu Gin Pro Ser Ala Ser Glu Thr Gin Ser Val Ser Asn Lys Ser 
35 40 45 

Arg Lys Ser Lys Phe Gly Lys Arg Leu Asn Phe lie Leu Gly Ala lie 
50 55 60 

Leu Gly lie Cys Gly Ala Phe Phe Phe Ala Val Gly Asp Asp Asn Ala 
65 70 75 80 

Val Phe Asp Pro Ala Thr Leu Asp Lys Phe Gly Asn Met Leu Gly Ser 
85 90 95 

Ser Asp Leu Phe Asp Asp Tie Lys Gly Tyr Leu Ser Tyr Asn Val Phe 
100 105 110 

Lys Asp Ala Fro Phe Thr Thr Asp Lys Pro Ser Gin Ser Pro Ser Gly 
115 120 125 

Asn Glu Val Gin Val Gly Leu Asp Met Tyr Asn Gju Gly Tyr Arg Ser 
130 135 140 

Asp His Pro Val lie Met Val Pro Gly Val lie Ser Ser Gly Leu Glu 
145 150 155 160 

Ser Trp Ser Phe Asn Asn Cys Ser lie Pro Tyr Fhe Arg Lys Arg Leu 
165 170 175 

Trp Gly Ser Trp Ser Met Leu Lys Ala Met Phe Leu Asp Lys Gin Cys 
ISO 1S5 190 

Trp Leu Glu His Leu Met Leu Asp Lys Lys Thr Gly Leu Asp Pro Lys 
195 200 205 

Gly lie Lys Leu Arg Ala Ala Gin Gly Phe Glu Ala Ala Asp Phe Fhe 
210 215 220 

lie Thr Gly Tyr Trp He Trp Ser Lys Val He Glu -Asn Leu Ala Ala 
225 " " 230 " 235 240 

He Gly Tyr Glu Pro Asn Asn Met Leu Ser Ala Ser Tyr Asp Trp Arg 
245 25 0 255 

Leu Ser Tyr Ala Asn Leu Glu Glu Arg Asp Lys Tyr Phe Ser Lys Leu 
260 255 270 
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Lys Met Phe lie Glu Tyr Ser Asn lie Val His Lys Lys Lys Val Val 
275 230 285 

Leu lie Ser His Ser Met Giy Ser G-ln Val Thr Tyr Tyr Phe Fhe Lys 
250 235 ^ 300 

Tru Val Glu Ala Glu Gly Tyr Gly Asn Gly Giy Pro Thr Trp Val Asn 
305 310 315 J20 

Asp His lie Glu Ala Phe lie Asn lie Ser Giy Ser Leu He Gly Ala 
325 330 335 

Pro Lys Thr Val Ala Ala Leu Leu Ser Gly Glu Met Lys Asp Thr Gly 
340 345 350 

He Val He Thr Leu Asn He Leu Glu Lys Fhe Phe Ser Arg Ser Glu 
355 36 0 365 

Arg Ala Met Met Val Arg Thr Met Gly Gly Val Ser Ser Met Leu Pro 
370 375 380 

Lys Giy Giv Asp Val Ala Pro Asp Asd Leu Asn Gin Thr Asn Phe Ser 
385 ' 390 395 400 

a^n Gly Ala He Arc Tyr Ara Glu Asrj He Asp Lys Asp His Asp 

405 ~ 410 4H 

Glu Phe Asp He Aso Asp Ala Leu Gin Phe Leu Lys Asn Val Thr Asp 
420 " 425 430 

Asp Asp Phe Lys Val Met Leu Ala Lys Asn Tyr Ser His Gly Leu Ala 
435 440 445 

Trp Thr Glu Lys Glu Val Leu Lys Asn Asn Glu Met Pro Ser Lys Trp 
450 455 460 

He Asn Pro Leu Glu Thr Ser Leu Pro Tyr Ala Pro Asp Met Lys He 



47 0 4/3 



4 7 5 4 3 0 



Tyr Cys Val His Gly Val Gly Lys Pro Thr Glu Arg Giy Tyr Tyr Tyr 



485 490 



Thr Asn *sn Pro Glu Glv Gin Pro Val lie Asp Ser Ser VH Asn Asp 

500 " 505 510 

Gly Thr Ly= Val Glu Asn Gly He Val Met Asn Asp Gly Asp Giy Thr 

5I5 520 525 

Leu Pro lie Leu Ala Leu Gly Leu Val Cys Asn Lys Val Trp Gin Thr 

530 535 5 40 

Lys Arg Phe Asn Pro Ala Asn Thr Ser He Thr Asn Tyr Glu He Lys 

5 45 550 555 -50 

His Glu Pro Ala Ala Fhe Asp Leu Arg Gly Giy Pro Arg Ser Ala Glu 
* 6 - 5 7 0 5 7 5 
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His Val Asp He Leu Gly His Ser Glu Leu Asn Glu lie lie Leu Lys 
580 585 590 



Val Ser Ser Gly His Gly Asp Ser 
595 



Val Pro Asn Arg Tyr lie Ser Asp 
600 605 



[1° Gin Glu He He Asn Glu He Asn Leu Asp Lys Pro Arg Asr 
~~ 610 615 620 
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<210> 14 
<211> 432 
<212> PRT 

<213> Arabidopsis tha liana 
<400> 14 

Mg- Ly^ Lys He Ser Ser His Tvr Ser Val Val He Ala He Leu Val 
1 " * 5 10 15 

Val Val Thr Met Thr Ser Met Cys Gin Ala Val Gly Ser Asn Val Tyr 
20 25 30 

Pro Leu He Leu Val Pro Gly Asn Gly Gly Asn Gin Leu Glu Val Arg 
35 40 45 

Leu Aso Arg Glu Tyr Lys Pro Ser Ser Val Trp Cys Ser Ser Trp Leu 
50 55 60 

Tyr P-o He His Lys Lvs Ser Glv Gly Trrj Phe Arg Leu Trp Phe Asp 
65 70 75 80 

Ala Ala Va 1 Leu Leu Ser Pro Pie Thr Arg Cvs Phe Ser Asp Arg Met 
85 90 95 

Met Leu Tvr Tvr Asp Pro Asp Leu Asp Asp Tyr Gin Asn Ala Pro Gly 
100 105 HO 

Val G'n Thr Arg Val Pre Mrs Phe Gly Ser Thr Lys Ser Leu Leu Tyr 
115 " 120 125 

Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser Tyr Met Glu His Leu Val 
130 135 140 

Lvs Ala Leu Glu Lvs Lys Cys Gly Tyr Val Asn Asp Gin Thr He Leu 
145 150 155 160 

Giy Ala D ro Tyr Aso Phe Arg Tyr Gly Leu Ala Ala Ser Gly His Pro 
* J " " 165 " ' HO 175 

Ser Arg Val Ala Ser Gin Phe Leu Gin Asp Leu Lys Gin Leu Val Glu 
180 135 HO 

Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro Val He Leu Leu Ser His 
195 200 205 

Ser Leu Gly Gly Leu Phe Val Leu His Phe Leu Asn Arg Thr Thr Pro 
210 215 220 

Ser ^rp Arg Arc Lvs Tvr He Lvs His Phe Val Ala Leu Ala A.I a Pro 

250 ^ 235 240 



Trp Glv Glv Thr He Ser Gin Met Lys Thr Phe Ala Ser Gly Asn Thr 

245 250 255 

Leu Gly Val Pro Leu Val Asn Pro Leu Leu Val Arc Arg His Gin Arg 

260 265 270 

Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro Ser Thr Lys Val Phe His 

575 280 285 
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Asp Arg Thr Lys Pro Leu Val Val Thr Pro Gla Val Asn Tyr Thr Ala 
290 295 300 

Tyr Gin Mer. Asp Arg Phe Phe Ala Asp lie Gly Phe Ser Gin Gly Val 
305 310 315 320 

Val Pro Tyr Lys Thr Arg Val Leu Pro Leu Thr Glu Glu Leu Met Thr 
325 330 335 

Pro Gly Val Pro Val Thr Cys lie Tyr Gly Arg Gly Val Asp Thr Pro 
340 345 350 

Glu Val Leu Met Tyr Gly Lys Gly Gly Phe Asp Lys Gin Pro Glu lie 
355 360 365 

Lys Tyr Gly Asp Gly Asp Gly Thr Val Asn Leu Ala Ser Leu Ala Ala 
370 " ~ 375 380 

Leu Lys Val Asp Ser Leu Asn Thr Val Glu lie Asp Gly Val Ser His 
385 390 395 400 

Thr Ser lie Leu Lys Asp Glu lie Ala Leu Lys Glu lie Met Lys Gin 
405 410 415 

lie Ser lie lie Asn Tyr Glu Leu Ala Asn Val Asn Ala Val Asn Glu 
420 425 430 
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<210> 15 
<2I1> 552 
<212> PRT 

<213> Arabidopsis thaiiana 
<4G0> 15 

Mer Gly Ala Asn Ser Lys Ser Val Thr Ala Ser Phe Thr Val lie Ala 
1 5 10 15 

Val ^he Phe Leu He Cys Gly Gly Arg Thr Ala Val Glu Asp Glu Thr 
20 25 30 

Glu Phe His Gly Asp Tyr Ser Lys Leu Ser Gly He He He Pro Gly 
35 40 45 

Phe Ala Ser Thr Gin Leu Arg Ala Trp Ser He Leu Asp Cys Pro Tyr 
50 55 60 

Thr Pro Leu Asp Phe Asn Pro Leu Asp Leu Val Trp Leu Asp Thr Thr 
65 70 75 SO 

Lys Leu Leu Ser Ala Val Asn Cys Trp Phe Lys Cys Met Val Leu Asp 
85 30 25 

Pro Tyr Asn Gin Thr Asp His Pro Glu Cys Lys Ser Arg Pro Asp Ser 
100 105 HO 

Gly Leu Ser Ala He Thr Glu Leu Asp Pro Gly Tyr He Thr Gly Pro 
115 * 120 125 

Leu Ser Thr Val Trp Lys Glu Trp Leu Lys Trp Cys Val Glu Phe Gly 
130 135 140 

He Glu Ala Asn Ala He Val Ala Val Pro Tyr Asp Trp Arg Leu Ser 
145 HO 155 160 

Pro Thr Lys Leu Glu Glu Arg Asp Leu Tyr Phe His Lys Leu Lys Leu 
165 170 H5 

Thr Phe Glu Thr Ala Leu Lys Leu Arg Gly Gly Pro Ser He Val Phe 
130 185 HO 

Ala His Ser Mel Gly Asn Asn Val Phe Arg Tyr Phe Leu Glu Trp Leu 
195 200 205 

Arg Leu Glu He Ala Pro Lys His Tyr Leu Lvs Trp Leu Asp Gin His 
210 215 220 

He His Ala Tyr Phe Ala Val Gly Ala Pro Leu Leu Gly Ser Val Glu 
225 230 235 240 

Ala lie Lys Ser Thr Leu Ser Gly Val Thr Phe Gly Leu Pro Val Ser 
245 250 255 

Glu Glv Thr Ala Arcr Leu Leu Ser Asn Ser Phe Ala Ser Ser Leu Trp 
260 " 265 270 

Leu Me- Pro Phe Ser Lys Asn Cys Lys Gly Asp Asn Thr Pne Trp Thr 
275 * 280 2S5 
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His Phe Ser Gly Gly 
290 

Asp Glu Glu Glu Tyr 
305 

lie Asn lie Glu He 
325 

Thr Leu Phe Lys Ala 
340 

Leu His Gin Leu Lys 
355 

Ala His Arg Ser Ser 
370 

Val Phe Asn Pro Leu 
385 

Phe Cys lie Tyr Gly 
405 

Ala Pro Ser Gly Lys 
420 

He Tyr Glu Thr Glu 
435 

Asp Gly Asn Ala Gly 
450 

Ser Leu Ser Trp Cys 
465 

Met Ala Pro Gin He 
4S5 

Asp Gly Ser Asp Val 
500 

Ser Asp He He Ala 
515 

Thr Phe Tyr Glu Asp 
530 

Trp Glu Leu Asp Lys 



Ala Ala Lys Lys Asp Lys 
295 

Gin Ser Lys Tyr Ser Gly 

310 315 

Pro Ser Thr Ser Ala Arg 
330 

He Glu Asp Tyr Asp Pre 
345 

Lys Tyr Val Pro Phe Phe 
360 

Leu Ala Gly Phe Leu Leu 
375 

Thr Pro Trp Glu Arg Pro 

390 395 

Ala His Leu Lys Thr Glu 
410 

Fro Tyr Pro Asp Asn Trp 
425 

Gly Ser Leu Val Ser Arg 
440 

Pro He Thr Gly Asp Glu 
455 

Lys Asn Trp Leu Gly Pro 
470 475 

Leu He Gly Lys He Lys 
490 

His Val Glu Leu Asn Val 
505 

Asn Met Thr Lys Ala Pro 
520 

Ser Glu Ser He Pro Gly 
535 

Ser Gly Tyr 
550 



Arg Val Tyr Kis Cys 
300 

Trp Pro Thr Asn He 
320 

Glu Leu Ala Asp Gly 
335 

Asp Ser Lys Arg Met 
350 

Val He Arg Asn He 
365 

Tyr His Asp Asp Pro 
380 

Pro He Lys Asn Val 
400 

Val Gly Tyr Tyr Phe 
415 

He He Thr Asp He 
430 

Ser Gly Thr Vai Val 
445 

Thr Val Pro Tyr His 
460 

Lys Val Asn He Thr 
480 

Gin Gin Pro Glu His 
495 

Asp His Glu His Gly 

510 

Arg Val Lys Tyr He 
525 

Lys Arg Thr Ala Val 
54 0 
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<210> 16 
<211> 661 
<212> PRT 

<213> Saccharoirryces cerevisiae 
<400> 16 

Mpr- Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
1 5 10 15 

Asp Glu Asr Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

s sn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

He Ser Gly Ser Ala Lvs Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

Ly<= Arg Asp Gly Asn Glv Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 " 70 75 30 

He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 30 9=> 

r-lw A1p ^Vr T -i^ Val ^ T is Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 HO 

Val »sr Phe Asi3 Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 " 120 125 

Leu Pro Gin Gly He Ser Ser Phe lie Asp Asp He Gin Ala Gly Asn 
130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser C-lu Asn Phe Ala Val Gly 
145 150 155 160 

Lvs Gin Leu Leu Arg Aso Tvr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

Met VH ^ro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

Gly A^p Asp Glu Cys Asd Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 ' " 200 205 

Glv Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

Leu Lys His Val Met Leu Asc Pre Glu Thr Gly Leu Asp Pro Pro Asn 
225 " 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

Ala Gly Tvr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
2 60 265 270 

r-lw Tyr Glu Pro Asn Lvs Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 2S0 235 
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Ala Tyr Leu Asp 
290 

Giu Gin lie Glu 
305 

lie Gly His Ser 



Val Glu Ala Glu 
340 

Asn Glu His lie 
355 

Ala Pro Lys Ala 
370 

lie Gin Leu Asn 

385 

Arg lie Glu Arg 



Men Leu Pro Lys 
420 

Ser Glu Asp Ala 
435 

Arg Phe Glu Arg 
45C 

Lys Asp Ala lie 
465 

Arg Arg Val His 



Glu Leu Arg Lys 
500 

Glu Val Pro Leu 
515 

Gly Val Asn Asn 
530 

Asp Ser Ser Ala 
545 

Val Phe Leu Thr 



Met Cys His Lys 
580 

lie Asn Val Thr 
555 



Leu Glu Arg Arg 

295 

Leu Phe His Gin 
310 

Met Gly Ser Gin 
325 

Gly Fro Leu Tyr 



Asp Ser Phe lie 
360 

Val Pro Ala Leu 
375 

Thr Leu Ala Met 

390 

Val Lys Met Leu 
405 

Gly Glu Glu Val 



Leu Asn Asn Asn 
440 

Asn Thr Ser Asp 
4 55 

Asn Met Thr Leu 
470 

Giu Gin Tyr Ser 
4S5 

Asn Glu Leu His 



Pro Glu Ala Pro 

320 

Pro Thr Glu Arg 
535 

Leu Asn Leu Thr 
550 

Glu Gly Asp Gly 
565 

Trp Ala Gin Gly 



lie Val Glu Met 
600 



Asp Arg Tyr Phe 
300 

Leu Ser Giy Glu 
315 

lie lie Phe Tyr 
330 

Gly Asn Giy Gly 
345 

Asn Ala A la Gly 



lie Ser Gly Glu 
380 

Tyr Gly Leu Glu 

395 

Gin Thr Trp Gly 
410 

He Trp G]y Asp 
425 

Thr Asp Thr Tyr 



Ala Phe Asn Lys 
4 60 

Ser He Ser Pro 
475 

Phe Gly Tyr Ser 
490 

His Lys His Trp 
505 

His Met Lys He 



Ala Tyr Val Tyr 
540 

He Asp Tyr Glu 
555 

Thr Val Pro Leu 
570 

Ala Ser Pro Tyr 
585 

Lys His Gin Pro 



Thr Lys Leu Lys 



Lys Val Cys Leu 
320 

Phe Met Lys Trp 
335 

Arg Gly Trp Val 
350 

Thr Leu Leu Gly 
365 

Met Lys Asp Thr 



Lys Phe Phe Ser 
400 

Gly lie Pro Ser 
415 

Met Lys Ser Ser 
43 0 

Gly Asn Phe He 
4 45 

Asn Leu Thr Met 



Giu Trp Leu Gin 
480 

Lys Asn Glu Giu 
495 

Ser Asn Pro Met 
510 

Tyr Cys He Tyr 
525 

Lys Giu Giu Asp 



Ser Lys Gin Pro 
560 

Val Aia His Ser 

Asn Pro Ala Gly 
5S0 

Asp Arg Phe Asp 
605 



'O J' J r J' » $3 ^' U3 

BASF-NAE 33 77 / 99 PCT 31/58 12.09.2000 



lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 615 620 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arc Gin Le\i Ser Asn Leu Ser Gin Tru Val Ser Gin 
645 650 655 



Met Pro Phe Pro Met 
660 
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<210> 17 
<211> 387 
<212> PRT 

<213> Arabidops i s thaliana 
<4G0> 17 

Val Gly Ser Asn Val Tyr Pro Leu lie Leu Val Pro Gly Asn Gly Gly 
15 10 15 

Asn Gin Leu Glu Val Arg Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val 
20 25 30 

Trp Cys Ser Ser Trp Leu Tyr Pro He His Lys Lys Ser Gly Gly Trp 
35 40 45 

Phe Arg Leu Trp Phe Asp Ala Ala Val Leu Leu Ser Pro Phe Thr Arg 
50 55 60 

Cys Phe Ser Asp Arg Met Met Leu Tyr Tyr Asp Pro Asp Leu Asp Asp 
65 70 75 80 

Tyr Gin Asn Ala Pro Gly Val Gin Thr Arg Val Pro His Phe Gly Ser 
85 90 35 

Thr Lvs Ser Leu Leu Tyr Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser 
100 105 110 

Tyr Met Glu His Leu Val Lys Ala Leu Glu Lys Lys Cys Gly Tyr Val 
115 120 125 

Asn Asp Gin Thr lie Leu Gly A3 a Pro Tyr Asp Phe Arg Tyr Gly Leu 
130 135 140 

Ala Ala Ser Gly His Pro Ser Arg Val Ala Ser Gin Phe Leu Gin Asp 
145 150 155 160 

Leu Lys Gin Leu Val Glu Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro 
155 170 175 

Val lie Leu Leu Ser His Ser Leu Gly Gly Leu Phe Val Leu His Phe 
130 185 190, 

Leu Asn Arg Thr Thr Fro Ser Trp Arg Arg Lys Tyr He Lys His Phe 
195 200 205 

Val Ala Leu Ala Ala Pro Trp Gly Gly Thr He Ser Gin Met Lys Thr 
210 215 220 

Phe Ala Ser Gly Asn Thr Leu Gly Val Pro Leu Val Asn Pro Leu Leu 
225 230 235 240 

Val Arg Arg His Gin Arg Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro 
245 ~ 250 255 

Ser Thr Lys Val Phe His Asp Arg Thr Lys Pro Leu Val Val Thr Pro 
260 265 270 

Gin Val Asn Tyr Thr Ala Tyr Glu Met Asp Arg Phe Phe Ala Asp He 
275 " 2S0 28z 
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Cly Phe Ser Gin Gly 

290 

Thr Glu Glu Leu Met: 
305 

Arg Gly Val Asp Thr 
325 

Asp Lys Gin Pro Glu 
340 

Leu Ala Ser Leu Ala 
355 

lie Asp Gly Val Ser 
370 



Val Val Pro Tyr Lys Thr 
295 

Thr Pro Gly Val Pro Val 

310 315 

Pro Glu Val Leu Met Tyr 
330 

lie Lys Tyr Gly Asp Gly 
345 

Ala Leu Lys Val Asp Ser 
360 

His Thr Ser lie Leu Lys 
375 



Arg Val Leu Pro Leu 
300 

Thr Cys lie Tyr Gly 
320 

Glv Lys Gly Gly Phe 
335 

Asp Gly Thr Val Asn 
350 

Leu Asn Thr Val Glu 

365 

Asp Glu lie Ala Leu 
380 



Lys Glu lie 
385 
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<210> 18 

<211> 389 

<212> PRT 

<213> Arabidopsis 



thai iana 



ttulvWys Glu Gly Leu Lys Ala Lys His Pro Val Val Phe He Pro 
I ' 5 10 

Ply H° Val Thr Gly Gly Leu Glu Leu Trp Glu Gly Lys Gin Cys Ala 

" 20 25 J ° 

Asp Gly Leu Fhe Arg Lys Arg Leu Trp Gly Gly Thr Phe Leu Cys Trp 
35 40 

Val Glu His Met Ser Leu Asp Asn Glu Thr Gly Leu Asp Pro Ala Gly 

50 55 60 

Tie Arc Val Arg Ala Val Ser Gly Leu Val Ala Ala Asp Tyr Phe Ala 
"65 " ™ ^ 

o-^o Glv TV- Phe Val Trp Ala Val Leu He Ala Asn Leu Ala His He 
_o 1-1/ iy- ^ go 95 

Glv -.r Glu Glu Lvs Asn Met Tyr Met Ala Ala Tyr Asp Trp Arg Leu 
• " ~ 100 il,J 

Cer =- re G'n Asp Thr Glu Arg Asp Gin Thr Leu Ser Arg Met Lvs Ser 
" " " 115 -20 125 

Asn lie Glu Leu Met Val Ser Thr Asn Gly Gly Lys Lys Ala Val He 
130 



140 



val Ero His Ser Met Gly Val Leu Tyr Phe Leu His Fhe Met Lys Trp 

14 5 150 155 loU 

— CI" Ala Ala Fro Leu Gly Gly Gly Gly Gly Pre Asp Trp Cys 

*" 165 170 1/3 



n e Lys Tvr He Lys Ala Val Met Asn He Gly Gly Pro Phe Leu Gly 
"" " 180 135 

val Pro Lvs Ala Val Ala Gly Leu Fhe Ser Ala Glu Ala Lys Asp Met 

195 200 
Arg Met Thr Arg Thr Trp Asp Ser Thr Met Ser Met Leu Pro Lys Gly 



rg 

210 



215 220 



Gly Asp Thr He Trp Gly Gly Leu Asp ,rp .e, - - ^ -t ^ 



230 



Ala -o Glu Met Glu He Tyr Ser Leu Tyr Gly Val Gly He Pro Thr 

245 250 

Giu Arg Ala Tyr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser Cys He 
260 265 

»r 0 -he Gin He Fhe Thr Ser Ala His Glu Glu Asp Glu Asp Ser Cys 
275 280 
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Leu Lys Ala Gly Val Tyr Asn Val 
290 295 

Leu Ser Ala Gly Tyr Met Cys Ala 
305 310 

Phe Asn Pro Ser Gly He Lys Thr 
325 

Pro Pro Ala Asn Leu Leu Glu Gly 
340 

Val Asp He Met Gly Asn Phe Ala 
355 360 

Ala Ala Gly Gly Asn Gly Ser Asp 
370 375 

Gly He Phe Glu Trp 



A-sp Gly Asp Glu Thr Val Pro Val 
300 

Lys Ala Trp Arg Gly Lys Thr Arg 

315 320 

Tyr He Arg Glu Tyr Asn His Ser 
330 335 

Arg Gly Thr Gin Ser Gly Ala His 
345 350 

Leu He Glu Asp He Met Arg Val 
365 

He Gly His Asp Gin Val His Ser 

380 
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<210> 19 
<211> 1986 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (!) . . (1983 ) 

<400> 19 

atg ggc aca ctg ttt cga aga aat gtc cag aac caa aag agt gat tct 48 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

1 " 5 10 15 

gat gaa aac aat aaa ggg ggt tct gtt cat aac aag cga gag age aga 9 6 
Asp Giu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

aac cac act cat cat caa cag gga tta ggc cat aag aga aga agg ggt 144 
Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

att agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 192 
He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 

aaa aga. gac ggg aac ggt aga aaa cgt tgg aga gat tec aga aga ctg 2 40 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
55 70 75 SO 

att ttc att cut ggt gca etc tta ggt gta ctt ttg ccg ttt age ttc 2SS 
He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtc cat aat age gat age gac ttg ctt gac aac ttt 336 
Hy Ala Tyr His Val His Asn Ser Asp Ser Asp Lea Phe Asp Asn Phe 
100 105 HQ 

gta aat ttt gat tea ctt aaa gtg tat ttg gat gat egg aaa gac get 384 
Val Asn Phe Asp Set Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt aca agt teg ttt att gat gat att cag get ggt aac 432 
Leu Pro Gin Gly He Ser Ser Phe Lie Asp Asp He Gin Ala Gly Asn 
130 135 140 

tac tec aca tct tct tta gat gat etc agt gaa aat ttt gec gtt cgt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cat cat tat aat ace gag gec aaa cat cct ctt gta 52S 
Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 
135 170 " 173 



atg gtt cct ggt gec att tct acg gga ate gaa age tgg gga gtc ate 
Met Val Pre Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
ISO 185 190 



576 
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:s 310 



0 



att caa tta aat acg tta gcc atg tat ggt ttg gaa a<*g ttc ttc tea 
lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 

390 395 400 



768 



gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 
Gly Asp Asp Glu Cvs Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
!95 200 205 

gaa agt ttt tac atg ctg aga aca atg get atg gat aaa gtt tgt tgg 672 
G I y Se - ^he Tvr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 ' 215 220 

ttg aaa cat gta ata tta gat cct gaa aca ggt ctg gac cca ccg aac 720 
Leu Lvs His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 ' 230 235 240 

-tt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 
245 250 255 

aca gga tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 
Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 
260 265 270 

ggc tat gaa ccc aat aaa atg acg agt gee gcg tat gat tgg agg ctt 864 
Gly Tvr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tta aat eta gaa aga cgc gat agg tac ttt acg aag eta aag 9i2 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 " 295 300 

gaa caa acc gaa ctg ttt cat caa teg agt ggt gaa aaa gtt tgt tta 960 
Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 



-tt aga -at tct atg gar tct cag att ate ttt tac ttt atg aaa tgg 1008 
lie Glv His Ser Mec Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 330 335 

gtc gaa gee gaa ggc cct ctt tac ggc aat ggt ggt cgt ggc tgg gtt 1056 
Val Glu Ala Glu Glv Pro Leu Tvr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

aac gaa cac aca aat tea ttc att aat gca gca ggg acg ctt ctg ggc 1104 
Asn Glu His He Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

act cca aag gca gtt cca get eta att agt ggt gaa atg aaa gat ace 1152 
a la Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 ^ 375 3S0 



100 



aaa att aag aaa aca aaa atg tea caa acg egg ggt ggc ata cca tea 1248 

Arg He Glu Arc Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
4 05 410 415 

atg eta cca aag aaa gaa gag gtc att tgg ggg gat acg aag tea tct 1295 

Met Leu Pro Lvs Glv Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 

42 n d25 430 



G *5 «3 3 .7 ;T . a G 
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tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 
Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 
435 440 445 



1344 



cga ttt gaa agg aat acg age gat get tec aac aaa aac ttg aca atg 
Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 " 455 460 



1392 



gac gec att aac atg aca tta teg ata tea cct gaa tag etc caa 
Lys Asp Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 " 470 475 480 



1440 



aga aga gta cat gag cag tac teg tte ggc tat tec aag aat gaa gaa 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 ** 495 



148S 



gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca acg 
Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Mec 
500 505 510 



1536 



gaa gta cca ctt cca gaa get ccc cac acg aaa ate tat tgt ata tac 
Glu Val Pro Leu Pro Glu Ala Pro Mrs Met Lys He Tyr Cys He Tyr 
515 520 525 



1584 



5515 <?tg aac aac cca ace gaa agg gca tat gta tat aag gaa gag gat 
Gly Val Asn Asn Pro Thr Glu Arc Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 



1632 



gac tec tct get ctg aat ccg acc ate gac tac gaa age aag caa cct 
Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 



1680 



gta ttc etc acc gag ggg gac gga acc get ccg etc gcg gcg cat tea 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 



1728 



acg cgt cac aaa tgg gee cag ggc get tea ccg tac aac cct gec gga 
Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
530 " 585 590 



1776 



ate aac gtc act att gtg gaa atg aaa cac cag cca gat cga ttc gat 
He Asn Val Thr He Val Glu Mec Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 



1824 



aca cat ggt gga gca aaa age gec gaa cac gta gac ate etc ggc age 
He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 ~ 615 620 



1872 



gcg gag ttg aac gat tac ate ctg aaa act gca age ggc aat ggc gat 
Ala Glu Leu Asn Asp Tyr He Leu Lys lie Ala Ser Gly Asn Gly Asp 
655 £30 535 " 640 



1920 



etc gec gag cca cgc caa ttg tct aat ctg age cag tgg etc tec cag 
Leu Val Gin Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 



1S63 



atg ccc tec cca acg caa 
Mec Pre Phe Pro Met 
660 



19S6 
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<210> 20 
<211> 661 
<2I2> ?RT 

<213> Saccharomyces cerevisiae 

MG"°GlY°Tbr Leu Phe Arc Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
1 5 10 15 

Asd Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

Asn His He His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

35 45 

Tie ^er Gly Ser Ala Lys Arg Asn Gin Arg Gly Lys Asp Phe Asp Arg 

Lys Arg Asp Glv Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 30 

He pvie He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

Gly Ala Tvr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

Va n . cn D Ve A c p ser L eu Lys Val Tvr Leu Asp Asp Trp Lys Asp Val 
~ H5 120 125 

r eu p ro Gin Glv He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
^" no " ^ 135 140 

Ty r S e^ Thr Ser Ser Leu Asd Asp Leu Ser Glu Asn Phe Ala Val Gly 

-en 1 ^ ~ 16 0 

145 ±50 

Lys GH Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 



165 



JO 175 



Met Val Pro Gly val He Ser Thr Gly He Glu Ser Trp Gly Val lie 
ISO 



185 190 



Glv Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
X95 200 205 

Glv Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

Leu Tvs val ™ez Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 

225 " ~ ' 230 235 240 

-v. c Tb .^ Leil ila iia Gin Gly Phe Glu Ser Thr Asp Tyr Phe lie 

245 250 255 

Ala Glv Ty- T-o He Tro Asn Lvs Val Phe Gin Asn Leu Gly Val He 

~ ,- ~ n ~ c 2/0 

Z cO ^ ° J 

Glv Tv- Glu Pro Asn Lvs Mec Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
-75 " ' 250 285 
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Ala Tyr Leu Asp Leu Glu 
290 

Giu Gin He Glu Leu Phe 
305 310 

Tie Gly His Ser Met C-ly 
325 

Val Glu Ala Glu Gly Pre 
340 

Asn Glu His He Asp Ser 
355 

Ala Pro Lys Ala Val Pro 
370 

He Gin Leu Asn Thr Leu 

385 390 

Arg He Glu Arg Val Lys 

405 

Met Leu Pro Lys Gly Glu 
420 

Ser Glu Asp Ala Leu Asn 
435 

Arg Phe Glu Arg Asn Thr 
450 

Lys Asp Ala He Asn Met. 
465 470 

Arg Arg val His Giu Gin 
485 

Glu Leu Arg Lys Asn Glu 
500 

Glu Val Pro Leu Pro Glu 
515 

Gly Val Asn Asn Pro Thr 
530 

Asp Ser Ser Ala Leu Asn 
545 55C 

Val Phe Leu Thr Glu Gly 
565 

Met Cys His Lys Trp Ala 
5 80 

lie Asn Val Thr He Val 
595 



Arg Arg Asp Arg Tyr 
295 

His Gin Leu Ser Gly 
315 

Ser Gin He He Phe 
330 

Leu 'Tyr Gly Asn Gly 
345 

Phe He Asn Ala Ala 
360 

Ala Leu He Ser Gly 
375 

Ala Met Tyr Gly Leu 
395 

Met Leu Gin Thr Trp 
410 

Glu Val He Trp Gly 
425 

Asn Asn Thr Asp Thr 
440 

Ser Asp Ala Phe Asn 
455 

Thr Leu Ser lie Ser 
475 

Tyr Ser Phe Gly Tyr 
4 90 

Leu His His Lys His 
505 

Ala Pre His Met Lys 
520 

Glu Arg Ala Tyr Val 
5 3 5 

Leu Thr He Asp Tyr 



Asp Gly Thr Val Pre 



Gin Gly Ala Ser Pro 
5 8 5 

Glu Met. Lys His Gin 
600 



Phe Thr Lys Leu Lys 
300 

Glu Lys Val Cys Leu 
320 

Tyr Phe Met Lys Trp 

335 

Gly Arg Gly Trp Val 
350 

Gly Thr Leu Leu Gly 

3 6 5 

Giu Met Lys Asp Thr 
380 

Glu Lys Phe Phe Ser 
400 

Gly Gly He Pro Sex 
415 

Asp Men Lys Ser Ser 
430 

Tyr Giy Asn Phe He 

4 4 5 

Lys Asn Leu Thr Met 
460 

Pro Glu Trp Leu Gin 
480 

Ser Lys Asn Glu Glu 
45 5 

Trp Ser Asn Pro Met 
510 

lie Tyr Cys He Tyr 

525 

Tyr Lys Glu Glu Asp 
540 

Gin Ser Lys Gin Pro 
560 

Leu Val Ala His Ser 
575 

Tyr Asn Pro Ala Giv 
590 

Pro Asp Arg Phe Aso 
605 
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lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp lie Leu Gly Ser 
610 ^ 615 620 

Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 



Met 



Pro Phe Pro Met 
660 



BASF-NAE 33 77 / 99 PCT ' ' 42/58 12.09.2000 



<210> 21 
<211> 1986 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (1) . . (1933) 

<400> 21 

stg ggc aca ctg ttt cga aga aat gtc cag aac caa aag agt gat tct 48 

Mec Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

15 10 15 

gac gaa aac aat aaa ggg ggt tct gtt cat aac aag cga gag age aga 96 
Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

aac cac att cat cat caa cag gga tea ggc cat aag aga aga agg ggt 144 
Asn Kis lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

att agt ggc agt gca aaa aga aac gag cgt ggc aaa gat ttc gac agg 132 
lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 ^60 

aaa aga gac ggg aac ggt aga aaa cgt egg aga gai tec aga aga ctg 240 
Lys Arc Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

att ttc ate ctt ggt gca ttc eta ggt gta ctt ttg ccg tct age ttt 288 
He Phe He Leu Gly Ala Phe Leu Giy Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtt cat aat age gat age gac ctg ttt gac aac ttt 335 
Gly Ala Tyr His Val His Asn Ser A_sp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

gta aac ttt gat tea ctt aaa gtg tat ttg gat gat tgg aaa gat gtt 3S4 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt ata agt teg ttt att gat gac att cag gec ggt aac 432 
Leu Pre Gin Gly lie Ser Ser Fhe He Asp Asp lie Gin Ala Gly Asn 
130 135 140 

tac tec aca tct tct tta gat gat etc agt gaa aat ttt gee gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tea cgt gac tat aat ate gag gec aaa cac cct gtt gca 523 
Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pre Val Val 
165 170 175 



atg etc cct ggt gtc act tct acg gga att gaa age tgg gga gtt att 
Mec Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val lie 
180 185 ISO 



576 
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gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 



624 



gga act ttt tac atg ctg aga aca atg gtt atg gat aaa get tgt tgg 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 



672 



ttg aaa cat gta atg eta gat cct gaa aca ggt ctg gac cca ccg aac 
Leu Lys His Val Met Leu Asp Fro Glu Thr Gly Leu Asp Pro Pro Asn 
225 " 230 235 240 



720 



ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tac ttc ate 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe Tie 
245 250 255 



768 



gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 
Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 
260 265 270 



816 



ggc tat gaa ccc aat aaa atg acg agt get gcg cat gat tgg agg ctt 
Gly Tyr Giu Fro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 



864 



gca tat eta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
29C 295 300 



912 



gaa caa ate gaa ctg ctt cat caa ttg agt ggc gaa aaa gtt tgt tta 
Glu Gin He Gin Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 



960 



att gga car tec atg ggt tct cag att arc ttc tac ttt atg aaa tgg 
He Gly His Ser Mec Gly Ser Gin He Tie Phe Tyr Phe Met Lys Trp 
325 530 335 



1008 



g^e gag ccc gaa ggc cct ctt tac ggt aat ggt ggt cgt ggc tgg gtt 
Val Glu Ala Giu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
3 40 34 5 3 50 



1056 



aac gaa cac ata gat tea ttc att aat gca gca ggg acg ctt ctg ggc 
Asn Giu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 



1104 



gec cca aag gca get cca get eta att agt ggc gaa atg aaa gat acc 
Ala Pre Lys Ala Val Pro Ala Leu He Ser Gly Glu Mec Lys Asp Thr 
370 375 3S0 



1152 



act caa tea aac acg tta gec atg tat ggt ttg gaa aag ttc ttc cca 
He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Gin Lys Phe Phe Ser 
335 590 395 400 



1200 



aga act gag aga gta aaa acg tta caa acg egg ggc ggt ata cca cca 
Arg He Glu Arg Val Lys Mec Leu Gin Thr Trp Gly Gly Tie Pro Ser 
i05 " 410 " ' 415 



1248 



atg eta cca aag 
Met Leu Pre Lys 
4 20 



gaa gag gec att tgg ggg gat atg aag tea tct 
:iy Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
425 430 



1296 
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tea gag gat gca ttg aat 
Ser GIu Asp Ala Leu Asn 
435 



aac aac act gac aca tac 
Asn Asn Thr Asp Thr Tyr 
440 



ggc aat ttc att 
Gly Asn Phe lie 
445 



1344 



cga ttt gaa age 
Arg Phe Glu Arc 
450 



aat acg age gat get ttc aac a^a 
Asn Thr Ser Asp Ala Phe Asn Lys 
4 5 5 4 6 0 



aat ttg aca atg 
Asn Leu Thr Met 



1392 



aaa gac gec att 
Lys Asp Ala Tie 
465 



aac atg 
Asn Met 
470 



aca tta teg ata 
Thr Leu- Ser lie 



tea cct gaa tgg etc caa 
Ser Pro Glu Trp Leu Gin 
475 480 



1440 



aga aga gta cat 
Arg Arg Vai His 



gag cag tac teg ttc ggc tat tec 
Glu Gin Tyr Ser Phe Gly Tyr Ser 
485 490 



aag aat gaa gaa 
Lys Asn Glu Glu 
495 



1488 



gag tta aga aaa aat gag eta cac cac aag cac tgg 
Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp 
500 505 



teg aat cca atg 
Ser Asn Pro Met 
510 



1536 



gaa gta cca ctt cca gaa get ccc cac atg aaa ate 
Glu Val Pre Leu Pro Glu Ala Pro His Met Lys He 

515 520 



tat tgt ata tac 
Tyr Cys He Tyr 
525 



1584 



gtg 
Val 

530 



aac aac 
Asn Asn 



cca act 
Pro Thr 



gaa agg gca tat 
Glu Arg Ala Tyr 



gca tat 

Val Tyr 
540 



aag gaa gag gat 
Lys Glu Glu Asp 



1632 



gac 
Asp 
545 



tec 
Ser 



tct get 
Ser Ala 



erg asi 
Leu Asn 
550 



ttg acc ate gac 
Leu Thr He Asn 



tac gaa age aag caa cct 
Tyr Glu Ser Lys Gin Pro 
555 560 



1680 



gta 



ttc etc acc 
Phe Leu Thr 



gag ggg 
Glu Glv 



gac 
Asp 



gga acc gtt 
Gly Thr Vai 
570 



ccg etc gtg gcg cat tea 
Pro Leu Val Ala His Ser 
575 



1723 



SLg tQ'L cac 
Met Cys His 



aaa 
Lys 
580 



tgg gec 
Trp Ala 



cag 
Gin 



ggc get tea 
Gly Ala Ser 
585 



ccg tac aac cct gec gga 
Pro Tyr Asn pro Ala Gly 
590 



1776 



act aac gtt 
He Asn Val 
595 



act 
Thr 



act gcg 
He Val 



gad 
Glu 



atg aaa cac 
Met Lys His 
600 



cag cca gat cga ttt gat 
Gin Pro Asp Arg Phe Asp 
505 



1824 



ata cgt ggt 
lie Arg Gly 
610 



Glv 



gca aaa 
Ala Lys 



age 
Ser 
615 



gec gaa cac 
Ala Glu His 



gta gac 
Vai Asp 
620 



ate etc ggc age 
lie Leu Gly Ser 



1872 



gcg gag ttg 
Ala Glu Leu 
625 



aac 
Asn 



gat tac 
Asp Tyr 
^30 



ate ccg aaa att gca age ggt aat ggc gat 
lie Leu Lys He Ala Ser Gly Asn Gly Asp 
635 640 



192C 



etc gtc gag 
Leu Vai Glu 



cca 
Pro 



cgc caa 
Arcr Gin 



ttg tct aat ttg 
Leu Ser Asn Leu 



age cag egg g-i_ ccc cag 
Ser Gin Trp Val Ser Gin 
^ c ^ 



19 68 



atg ccc ttc cca atg taa 
Met Pro Phe Pro Met 
660 



19Sz 
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<210> 22 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 22 

Met Gly Thr Leu Fhe Arg Arg Asn Vai Gin Asn Gin Lys Ser Asp Ser 
1 " 5 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Fhe Asp Arg 
50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg A.rg Leu 
65 70 75 80 

He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 9 5 

Cly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 ~ 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg A.sp Tyr Asn He Glu Ala Lys His Pro Val Val 
165 170 175 

Met Vai Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Gys Trp 
210 " 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

Ala Gly Tyr Trp lie Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 2"0 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 235 
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Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 

290 * 295 300 

Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 

305 310 315 320 

He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 

325 330 335 

Val Glu Ala Glu Gly Pro Leu "Tyr Gly Asn Gly Gly Arg Gly Trp Val 

340 345 350 

Asn Glu His lie Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 

355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 

370 375 380 



He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Sei 



385 



390 



400 



Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 
4G5 410 415 

Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp vhr Tyr Gly Asn Phe He 
435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lys Asp Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
4S5 4 90 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 
515 520 525 

Gly Val Asn Asn Pre Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
355 570 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pre Tyr Asn Pro Ala Gly 
5cC 585 590 



He Asn Val Thr He Val Glu Met Lys H:.s Gin Pro Asp Arg Phe Asp 
595 6 GO 5 05 



u a ' j • ~; 7 v ~ j, „ -o y ts m © 
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He Arg Gly Gly Ala 
610 

Ala Glu Leu Asn Asp 
625 

Leu Val Glu Pro Arg 
645 



Lys Ser Ala Glu His 
615 

Tyr He Leu Lys lie 
630 

Gin Leu Ser Asn Leu 
650 



Val Asp He Leu Gly Ser 
620 

Ala Ser Gly Asn Gly Asp 

535 640 

Ser Gin Trp Val Ser Gin 
655 



Met Fro Phe Pro Met 
660 



'IL.fi 2,,5*f'^7.; 
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<210> 23 

<211> 2312 

<212> genomic DMA 

<213> Schizcsaccharomyces pornbe 



<400> 23 

atggcgt ct t 

gacttaccaa 

acacaatccg 

ttgggcgcta 

gtLttcgacc 

gatgacatta 

aagccttcgc 

ggatatcgaa 

agt tggt cgc 

tc ta tgctga 

aaaaaaaccg 

gccgatt tt t 

attggttatg 

a a cLt agagg 

actgtacata 

tattttttta 

gatcatattg 

tctgaacaga 

gg tgaaatga 

taaccgt t t t 

gat t tgt tac 

caaaaataca 

ctatgggagg 

gttggccaag 

atcaaacaaa 

acgatgaatt 

t caaagtca" 

taaaaaataa 

taczaaacta 

t tgcgcncac 

ygggcaacct 

atctacgtct 

tggaac C t ta 

gtttaatcct 

tgatctgaga 

aaatgtatgt 

1 t tt aaaagt 

agtacggaca 

a tgagataaa 



ccaagaagag 
attcaaagaa 
tt tcaaataa 
tcttgggaat 
ctgctacctt 
aagga t a t t c 
agtctcctag 
gtgaccaccc 
t taataattg 
aggcaatgtt 
gcttggat cc 
ttatcacggg 
agcctaataa 
aacgtgataa 
agaaaaacct 
agtgggttgs 
aagcat tta t 
ta tcgggatc 
aagatacagg 
aagctcaac t 
taacttactg 
aatgtgctct 
agttagttct 
aaatatgtgc 
tttttccaat 
tgacatagat 
gc tagcgaaa 
cgaaatgccg 
Cactaaccca 
ggggccggaa 
gtcac tga C t 
caaacat tct 
ccaatac tag 
gc taatacaa 
ggaggacctc 
tcattt tacc 
ttcatcaggc 
taagt tttgt 
tct caataaa 



caaaac teat 
accaac tege 
atcaagaaaa 
atgcggtgct 
agacaaattt 
atct tataat 
eggaaatgaa 
tgt tattatg 
ctcgat tcct 
ccttgacaag 
gaagggaatt 
ctattggatt 
catgtt aagt 
atattt ttca 
agtgttgatt 
agctgagggc 
aaatgtgagt 
1 1 t gat tgga 
cat tgtaa tt 
gaa tcagt tt 
t ttagt ttgg 
act tt: cc ta 
atget tccta 
tgttaatttt 
ggtgcaat ta 
gatgeattae 
aat tattccc 
tct aaa tgga 
aatagactag 
aaccaactga 
cctcgcttaa 
att aac tgt t 
ccc ttggt tt 
gtatcacaaa 
geteggcaga 
ttacaaat tt 
cat ggtgac t 
agat tgcaat 
cctagaaat t 



aagaaaaaga 
gctttgagtg 
tctaaat t tg 
t t ttt tttcg 
gggaatatgc 
gtgtttaagg 
gttcaagt tg 
gttcctggtg 
tactttagga 
caatgetgge 
aagctgegag 
tggagtaaag 
gcttcttacg 
aagttaaaaa 
tc tcactcca 
tacggaaatg 
ctcgatggtt 
gcacccaaaa 
acat taaaca 
tegg tct atg 
aaaaattttt 
actt ttaata 
aaggaggega 
ttattaatat 
t tcgatatag 
aat t ttt aaa 
aeggtec tgc 
taaacccgct 
c ct tcct tat 
gagaggt tat 
tgatggaaca 
t tat tagggt 
cgtgtgcaat 
t tatgaaatc 
acaegtcgat 
c tat tac taa 
egg tacc aaa 
t aac t aac ta 
aa 



aagaagtcaa 
agcaacct tc 
gaaaaagat t 
ctgttggaga 
taggc tct tc 
atgcaccttt 
gtcttgatat 
ttatcagctc 
aacgtctttg 
ttgaacattt 
cagctcaggg 
taat t gaaaa 
attggcggtt 
tgt teat tga 
tgggt tcaca 
gtggaccgac 
gt ttgactac 
cagtggcagc 
tgttaatat t 
ggtaagcaat 
t tcccgttct 
cagagecatg 
tgttgtatgg 
ttaggctcca 
agaagaca 1 1 
aaatgt taca 
t tggactgaa 
agaagtaaga 
gctcctgata 
tat tatacta 
aaagttgaaa 
at tgt tatgg 
aaagt t tggc 
aagcatgaac 
atact tggac 
ctcttgaaat 
ccgt tatata 
accgaacagg 



ate tcctatc 


60 


agegtccgaa 


120 


gaattt tata 


180 


egacaatget 


240 


agacttgtt t 


300 


tactaeggae 


360 


gtacaatgag 


420 


aggat tagaa 


480 


gggtagctgg 


540 


aatgettcat 


600 


gtttgaagca 


660 


ccttgctgca 


72 0 


ateatatgea 


7S0 


gtacagcaac 


840 


ggt taegtae 


900 


t tgggt taat 


960 


gtttct aact 


1020 


get 1 1 tatcg 


1080 


taat ttt tgc 


1140 


aaat tg t tga 


1200 


gaggtatatt 


1260 


atggttcgca 


1320 


ggaaatgeca 


1380 


gatgatctta 


1440 


gataaggacc 


1500 


gatgacgat t 


1560 


aaagaagtgt 


1620 


acat taaagt 


1680 


tgaaaattta 


1740 


ataatcc tga 


1800 


atgtgagaga 


1860 


atgatggtga 


1920 


aaacaaaaag 


1980 


ctgctgcgt t 


2040 


attcagagct 


2100 


aaggaaatta 


2160 


tcagatatcc 


2220 


gaaataataa 


2280 




2312 
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<210> 24 

<211> 3685 

<212> genomicDNA 

<215> Arabidcpsis thaliana 

<400> 24 

atgcccctta ttcatcggaa aasgccgacg 
ctggtgcacg atgaggattc ccaaBagaaa 
aaatcgaacg gaggagggaa gtggtcgtgc 
gtgtgtgtaa cctggtggtt tcttctcttc 
cagtatgtaa cggagcgaat cacgggtcct 
aaagaaggtc ttaaggcgaa acaicctgtn 
ctcgaccttt gggaaggcaa acaatgcgct 
ggaacttttg gtgaagtcta caaaaggtga 
gggatttgga ttggatcaga tgagatcacg 
tttaattcca tgtttgtctg tctcactctt 
attttcctaa gagactattt ctgtacgtgt 
tggactatcc ctgtttgatt gttaacitca 
gtcatcttat tgccaaatcc gttgctagac 
ttcctttact gtcgttgcgt gtagatttag 
^atgtcttgt tgggaataga caagctctaa 
gataaaggac tgtcgcttta ttatccaact 
tttgatcttt cagctcaatg tgc~tttctc 
cgagtttatt cacatgtctt gaatctcgtc 
ccccccgacc ctgctatgga catautaaaa 
acgatcrgca tcttcttctc tcggcccagt 
aatgttattg ccgacgotaa cagcgctata 
c:t:ga:tca ggcctctatg ttgggtggaa 
gatccagctg gcattagagt ccgagctgta 
cccggctact ttgtctgggc accgcugatt 
aaaaatatgr acacggctcc atacgactgg 
tactcatcgt tctttctact aLLCtgttcc 
cttaaatatg cctcatgctg aattaatagg 
gtaatataga gutgatggtt tctaccaacg 
ccatgggggt cttgtatttt ccacatttca 
gtggcggggg tgggccagat tggtgtgcaa 
gaccatttct tggLgttcca aaagctgttg 
ttgcagttgc caggtattga aaatcagctt 
gaactcaaag ttattcract aaataccaat 
ctgacattgg ttgattattt ttgctgctta 
atgatgaatt gataattctt acgcattgct 
taacatangc catactgccc tttggagggc 
ttccatgctt gcattcttta ccaattagcg 
tgcatctzatg gtct 1 t ta 1 t tcrcctcaat 
gtcacttggt tcc:caacat agaacctcac 
aggattccta gacaccgata tat-tagacc 
acgcacatgg gactcaacaa cgtctacgct 
gcttgattgg tcaccggaga aaggccacac 
aacttgtggu gaagcaggtg aaaacggagt 
gatgacatct tttgggaaag aagtagcaga 
ctatcgacca aggacatana aatcataaia 
azatctgtac attttacccg gugaagggag 
cctgtcgcga cgtgtggaca cagcaccatg 
ctgagaacaa ggtctacact gctggtgaac 
agatgatggc gcgcggugcc gcucaactct 
ccaagcatca agaccccaaa taccggccaa 
gatcccaact gcanccttcg ccctgacgca 
g/acacgcttc tcagctcaaa gcttacaaag 
c:cacLaata ttgaggtgct aaagttgata 



gagaaaccat cgacgccgcc atctgaagag 60 
ccacacgaat ccuccaaatc ccaccataag 120 
atcgattctt gttgttggtc cattgggtgt 180 
ctttacaacg caatgcctgc gagcttccct 240 
ttgcctgacc cgcccggtgc taagctcaaa 300 
gtcttcattc ctgggactct caccggtggg 360 
gatggtttat ttagaaaacg tttgtggggt 420 
gctcaacaat tctcactctt cctttatatt 480 
cacttgttgc ttcttcaaca tcactcaaac 540 
tacttctttt tttttttgat gtgaaacgct 600 
aaggtaagcg ttccaaggac gtaattggct 6 60 
ggatataaaa tacctgcctt ggaatttcaa 720 
atgccctaga gtccgttcat aacaagttac 7S0 
ctttgtgtag cgtacaacga agtagtgttt 840 
ctacatctgt ggaaagtgtg ttcaggctgt 900 
acgtatatgt g^aattaaag ctagttccct 960 
aacttttttc tcaacttcaa agttccacat 1020 
catcctcgct ctgttatcca gctttgaact 1080 
aaaaagtgtt tcgtgggttg catctttgtt 1140 
gtccatgttt ttgccatggt agagatgggc 1200 
gttgatagta tcccaacnaa tcaattatct 1260 
cacatgtcac ttgacaauga aactgggctg 1320 
tcaggactcg tggccgctga ctactttgct 1380 
gctaacctcg cacataatgg acatgaagag 1440 
cggctttcgt ttcacaacac agaggntctt 1500 
atgttacgtt tctctcttca ttacttaagg 1560 
tacgtgauca gacccctagc cgtatgaaaa 1520 
gtggaaaaaa agcagatata gttccgcatt 1680 
tgaagtgggt tgaggcacca gctcctctgg 1740 
agtatatcaa ggcggtgatg aacacnggtg 1800 
cagggctttt cnctgctgaa gcaaaggatg 1360 
atacttttga tgatcagaac cttggctctg 1920 
cccaacaaca ttgccatatt atcgctgcaa 1980 
tgtaactgaa actctctcga gattagacaa 2040 
ctgcgatgac cagtttctca gcctcgacga 2100 
attgaatttt gctatggaaa gcgctggagc 2160 
ttattctgci: tctttcaatt ttcttgcata 2220 
taaagactcg ttggattag'a tgctctacta 2280 
tttcttcgaa aattgcagag cgattgcccc 2340 
tcagacctcg cagcatgtaa tgagaatgac 2400 
accgaaggga ggtgacacga tatggggcgg 2460 
ctgctgcggg aaaaagcaaa agaacaacga 2520 
ttccaagaaa agccctgcta actatggaag 2580 
ggctgcgcca tctgagatta azaatactga 2640 
aaccctgtac aci:cgcgac tgtatgatga 2 7 00 
ctgtcaaagg ccagagtatc ccaaatcaca 2760 
acatgggaat tgctgcgatc aaagctatcg 2S20 
ctatagatct acta cat tat get get cat a 2 830 
ctcatggaa: tgctgaagat ttgeacgaca 2940 
ateegtcaga gacaaagtaa gtgattccct 3 000 
ttatcagtct ttttgtcccc ggtcttgccg 3060 
ccgcttczgc gcctttctca aaaacgc:cg 5120 
catgtgactc ttccttataa atcctccgtt 3:30 



O %M L:S ^" "y* h 

BASF-NAE 33 77 / 99 PCT ' 50/58 1 2.09.2000 



tggtttgttc 
cggagtgggg 
t tgcaccccc 
agcaggagtt 
gtgtgcaaaa 
aagagaatac 
tgcccatgtt 
cggaggtaac 
ggagcgtatc 



tgctttttca 
ataccaacgg 
tt tcagatat 
tacaatgtgg 
gcgtggcgtg 
aatcaccctc 
gatatcatgg 
gggtctgata 
gacctgaagc 



gat taccgaa 
aacgagcata 
tcacctctgc 
atggggatga 
gcaagacaag 
cgccggctaa 
gaaactttgc 
taggacatga 
tgtga 



tgctcctgag 
cgtatacaag 
tcacgaggag 
aacagtaccc 
attcaaccct 
cctgt tggaa 
t ~ tgatcgaa 
ccaggtccac 



acggaaatct 
ct taaccagt 
gacgaagata 
gccctaagcg 
tccggaatca 
gggcgcggga 
gacatcatga 
tctggcatat 



actcattata 
ctcccgacag 
gctgtc tgaa 
ccgggtacat 
agacttatat 
cgcagagtgg 
gggttgccgc 
tcgaatggtc 



3240 
3300 
3360 
3420 
3480 
3540 
3 600 
3660 
3685 
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<210> 25 
<211> 402 
<212> cDNA 

<2i3> Arabidopsis thai i ana 

<220> 

<221> CDS 

<222> (120) . . (401) 

<400> 25 

agaaacagct ctttgtctct ctcgactgat ctaacaatcc ctaatctgtg ttctaaattc 60 

ctggacgaga tttgacaaag tccgtatagc ttaacctggt ttaatttcaa gtgacagat 119 

acg ccc ctt att cat: cgg T aaa aag ccg acg gag aaa cca teg acg ccg 157 
Met Pro Leu Tie His Arg Lys Lys Pro Thr Glu Lys Fro Ser Thr Pro 
15 10 15 

cca tct gaa gag gtg gtg cac gat gag gat teg caa aag aaa cca cac 215 
Pro Ser Glu Glu Val Vai His Asp Glu Asp Ser Gin Lys Lys Pro His 
20 25 , 30 

gaa tct tec aaa tec cac cat aag naa teg aac gga gga ggg aag tgg 263 
Glu Ser Ser Lys Ser His His Lys Xaa Ser Asn Gly Gly Gly Lys Trp 
3 5 4 0 4 5 

teg tgc ate gat tct tgt tgt tgg ttc att ggg tgt gtg tgt gta acc 311 
Ser Cys lie Asp Ser Cys Cys Trp Phe lie Gly Cys Val Cys Val Thr 
50 55 60 

tgg tgg ttt ctt etc ttc ctt tac aac gca atg cct gcg age ttc cct 359 
Trp Trp Phe Leu Leu Phe Leu Tyr Asn Ala Met Pro Ala Ser Phe Pro 
65 70 75 80 

cag tat gta acg gag ccg aat cac gng tec ttt gec tta ccc g 402 
Gin Tyr Val Thr Glu Pro Asn His Xaa Ser Phe Ala Leu Pro 

85 90 



3L3i „;f* 77 
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<210> 26 
<2il> 643 
<212> cDNA 
<213> Zea mays 

<220> 

<221> CDS , 
<222> (1) . . (402) 

<400> 26 

egg gag aaa ata get get ctg aag ggg ggt gtt tac tta gec gat ggt 

Arg Glu Lys He Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
1 5 10 15 

gat gaa act get cca gtt ctt agt gcg ggc tac atg tgt gcg aaa gga 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 , 30 

tgg cgt ggc aaa act cgt ttc age coc gec ggc age aag act tac geg 
Trp Arg Gly Lys Thr Arg Phe Ser Fro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

aga gaa tac age cat teg cca ccc tct act etc ctg gaa ggc agg ggc 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 



acc cag age ggt gca cat gtt ga 



atg ggg aac tct gee cc^ 



ict 



Thi 



Gin Ser Gly Ala His val Asp He Met Gly Asn Phe Ala Leu He 
70 7 5 SO 



gag gac gtc ate aga ata get get ggg gca acc ggt gag gaa att ggt 

Glu Asd Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 
85 90 95 

ggc gae cag gtt tat tea gat ata ttc aag tgg tea gag aaa ate aaa 

Gly Asp Gin Val Tyr Ser Asp Tie Phe Lys Trp Ser Glu Lys He Lys 
100 * 105 HO 

ttg aaa ttg taa cct atg gga age taa aga agt gee gac ccg ttc ace 
Leu Lys Leu 
115 

gcg ttc caa age gec ctg ectgagtgea actctggatt ttgcttaaat 

attgtaatet ttcacgctec attcgtccct ttgtcaaatt tacattegac aggacgecaa 

tgegataega tgttgtaccg ctattttcag cattgtatat taaactgtac aggegcaagt 

tgcattcgcc acctgaaatt gtgtagccgt tttctttacg atttaatanc aagtggcgga 
gcagtgcccc aagcnaaaaa aaaaaaaaas a 



96 
144 

192 

240 

288 

336 

334 

432 
492 
552 
612 
543 
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<210> 27 
<211> 115 
<212> PRT 
<2I3> Zea mays 



<400> 30 

Arg Glu Lys He Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 

15 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 

20 25 30 

Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 

35 40 45 

Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 

50 55 60 

Thr Gin Ser Gly Ala His Val Asp He Met Gly Asn Phe Ala Leu He 
65 70 75 80 

Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu lie Gly 

85 90 95 

Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 110 

Leu Lys Leu 
115 
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<210> 28 
<211> 516 
<212> cDNA 

<213> Neurospora crassa 



<400> 28 

ggtggcgaag 

accctctaga 

ggaagccgac 

t ttagatgac 

gcgaaggcga 

aaatgaagag 

cagaacggtt 

tgaaatttat 

aaaaaaaatt 



acganggcgg 
cacacgacia 
ggagcgagcc 
gatcgatacg 
tggcacagtg 
acacaatcct 
caatccgaga 
gaagactaat 
tt ttt tctaa 



aagttggagg 
ccnttgcacc 
tacatctatc 
act ttgactn 
aaccttatga 
gcgggctcaa 
ggagggccga 
taaatacggc 
aaaaaaaaaa 



ctaacgagaa 
cagcctcaag 
tggcgcccga 
aggggcacat 
gtttggggta 
aaataaccgt 
atacggcgga 
acatagg t ta 
aaaaaa 



tgacncccgg 
g tntacngtt 
tcccgggacg 
tgaccacggt 
cctgtgcaat 
cgtcgagatg 
cttaaatatg 
ctcaatagta 



agatggauct 
tntatgggta 
acaacgcatc 
gtgatt ttgg 
aaggggtgga 
ccgcatgaac 
tagaaaaggt 
tgactaacta 



60 

120 

180 

240 

300 

360 

420 

480 

516 
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<210> 29 

<211> 1562 

<2 12 > genomic DNA 

<213> Arabidcpsis thaliana 



<400> 29 

atgaaaaaaa tatcttcaca ttattcggta 
acctcgatgt gtcaagctgt gcgtagcaac 
ggaggtaacc agctagaggt acggctggac 
agcagctggt tatatccgat tcataagaag 
gcagcagtgt tattgtctcc cttcaccagg 
gaccctgatt tggatgatta ccaaaacgct 
ggtccgacca aatcacttct atacctcgac 
catcatctLg ggacatttgc ataatgaaca 
atatcaacat ccatttatat gctagtcggt 
tgagtgttat gtgattttcc attttaaatg 
ctatgtcatg agaatcataa ggacaccatg 
gcagagatgc cacacctcac atggaacatt 
atgttaacga ccaaaccatc ctaggagctc 
cgggccaccc gtcccgtgta gccccacagt 
aaactagcag cgagaacgaa ggaaagccag 
:tttcgccct ccatttcctc aaccgcacca 
actttgttgc actcgctgcg ccatggggcg 
ctggcaacac actcggtgtc cctttagcta 
cctccgagag taaccaacgg ccacttccat 
cgctcgccgc aactccccsg gitiaaccaca 
acactggatt cccacaagga gctgtgccat 
agccgatgac tccgggagtg ccagtcactc 
aggtuctgat gtatgcaaaa ggaggattcg 
gagatgggac ggttaattcg gcgagcttag 
cagagactga tggagtttcg catacatcta 
tcacgaagca gacttcaatc attaactatg 
ga 



gtcatagcga 


tactcgttgt 


ggtgacgatg 


60 


gtgtaccct t 


tgattctggt 


tccaggaaac 


120 


agagaataca 


agecaagtag 


tgtccggcgt 


180 


agtggtggat 


ggt t taggct 


atggtccgat 


240 


tgct tcagcg 


atcgaatgat 


cttgtactat 


300 


cctggtgtcc 




tec tcatttc 


360 


cctcgtctcc 


ggttagtacc 


t tccaagata 


420 


aaat agacat 


aaatttgggg 


gat tattgt t 


480 


aatgtgagtg 


ttatgttagt 


atagttaatg 


540 


a age t agaaa 




aat aa t g 1 1 g 


600 


taaatgtagc 


ttaataataa 


gguttgatt t 


660 


tggtgaaagc 


tctagagaaa 


aaatgcgggt 


720 


catatgattt 


caggtaegge 


ctggctgct t 


730 


tcctacaaga 


cctcaaacaa 


ttggtggaaa 


840 


tgatactccc 


ctcccatagc 


ctaggaggac 


900 


cccccccatg 


gcgccgcaag 


t ac a t c aaa c 


960 


ggacgatctc 


tcagatgaag 


acat ttget t 


1020 


accctttgct 


ggtcagaccg 


cat cagagga 


1080 


ctaccaaagt 


gtttcacgac 


agaactaaac 


1140 


caccttacga 


garggaregg 


ttt tttgcag 


1200 


acaagacaag 


agtgttgcct 


ttaacagagg 


1260 


gcatat atgg 


gagaggagtr 


gatacacegg 


1320 


ataagcaacc 


agagaciaag 


tatggaga^g 


1380 


cagctttgaa 


agtcgatagc 


t cgaacaccg 


144 0 


tact taaaga 


egagatcgea 


cttaaagaga 


1500 


aattacccaa 


tgttaatgcc 


gtcaatgaat 


1560 








1562 
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<210> 30 

<211> 3896 

<212> gencmic DMA 

<213> Arabidcpsis thaliana 

<400> 30 

atgggagcga attcgaaatc agtaacggct 
atttgcggtg gccgaactgc ggcggaggat 
ctatcgggca taarcattcc gggatttgcg 
gactgtccat acactccgct ggacitcaat 
aaggnccgtg atcttcactt ccttcgctcc 
aatcccaagc gaaacatagc aatgaagcat 
gtcaacagtc acgcttctga atctgagttc 
tgttucccat cgctt z tggt tcgctaaatg 
tttttattca accagacctg caagttcttc 
ggtcatttta cttgtgcatt gtgattcttt 
tggtttacag cttctttctg cegccaactg 
taatcaaaca gaccaccccg agtgtaagtc 
agaattggat ccaggttaca taacaggtag 
tcaaettgat atcatcctgt tgtgatataa 
tcaggLcctc ttcccactgc c^ggaaagag 
gaagcaaatg caattgtcgc tgtcccatac 
gagcgcgacc trtacuttca caagctcaag 
ttctccctc: atgtaagaca agctaagagc 
tttgaaactg cttaaaaact ccgtggcggc 
aaeaacorct tcagatactc tctggaatgg 
ttgaagtcgc ctgatcagca tatccatgct 
ttaagttacc attttatttt ttctccaatt 
agctcga:ac ctgatttgtt gutgatctag 
tcaaatctac tctctctggt gtaacgtttg 
tctccttagt :':taagtagt tgatancaac 
tgsaactatt actttrgtta atugaactgc 
agtgctagtt atcaaagaac atacugcggg 
caaccaaacc acatgcacac tgatutagtc 
gaagaaactt tgactgaaa: ccccttatLt 
tgtgacatat tgacatgcgc ttctcatgtc 
ggtugttgtc caat^ctttt gcgtcgtcat 
agggtgacaa cacatuctgg acgcatcctt 
tacaccaccu cgaagaagag gaauccaac 
ttaacatcga aactccttcc actagcggtt 
aaaagtttca ccaagaatgt tcactcccat 
tacagaaaca gccctagcca acargaccag 
tttcacagcc cgcgaactag cagatgggac 
agacaccaag aggatgttac accagctaaa 
tatugctcat cgatcaccac ctgctggcct 
ccaLa:caa: igctcatatg ctctgccttt 
ccctatcatt ga::atcagt tccctcctta 
atgaatccaa aagggggtat :ttagtcgat 
caatagcg:c aatcttgttt ccctaccaaa 
atac:;ttca ggttgtatca cgatcaccct 
ccacccataa aaaacgta:: atgcatatac 
attctcaata rcacaccaic cgatgacttc 
tcttcctgaa c:acgatc:a zzzcczccct 
ccaaa-igta: gaagctctct gtcacagcc: 
ta^cctgata a c toga c car cacggatatc 
aggcaactzi: ccgcaacggc agaag^aaaa 
GtcgcatCLt accrcagecg cacaagcaaa 
tcaccaUcc atttgagcut agtggatgat 
tgaaacacga caczzgzacc aaagacaact 



tccttcaccg tcatcgccgt ttttttcttg 60 
gagaccgagt ttcacggcga ctactcgaag 120 
tcgacgcagc tacgagcgtg gtcgatcctt 180 
ccgcticgacc tcgtatggct agacaccacc 240 
ttatcccgtc ggtcgagtca cttgttgatg 300 
gtcccgLCtc tcetattgat tcgttcacta 360 
agagtcatau aaaacagctg acccggcgag 420 
tagcgcaatg aatgtgtaat tagtctgcgc 480 
agagugctca atagtagtca gaaaatgtta 540 
tggttgttgc t tact gat eg acgtgaugga 600 
ctggcttaag tgtatggtgc tagatcctta 660 
acggcccgac agtggtcxtt cagccaccac 720 
tttcggattt ttctttcttc tgagttTatcc 780 
tatggctaag ttcattaact tggtcaautt 840 
tggcttaagr ggtgcgttga gtttggtata 900 
gattggagac tgtcaccaac caaat:ggaa 960 
ttagtcctta tcaggctaat gtcttttatc 1020 
cctggtcgtc ttcctttttg caggttgacc 1080 
ccctctatag tattLgccca tecaatgegt 1140 
ctgaggctag aaactgcacc aaaaca^tat 1200 
tatttegctg rtggtaccgg ccuactatcc 12 SO 
gggggagtta tgctgtgact cactggatt.g 1320 
gagctcctct tcttggctct gttgaggcaa 1380 
gccttcctgu ttctgaggtg acctccgact 1440 
caggtcctat aactcactgg atuctccttt 1500 
~gtacgcgat atggtatctg tagatcttga 1560 
cagtacacct gtcagcggcc ttagctaata 1620 
czcagattat taeggcagae tttaagctga 1630 
taataggcta tgatttgett atcgaaatca 1740 
tttegtagge aaggcttcag ggaactgetc 1800 
cgtggcttat gccatLctca aagaattgea 1360 
ctggggctgc tgcaaagaaa gataagegeg 192 0 
caaaatattc tggctggccg acaa.acatt;a 1930 
agactccgca tacgeaactg caacaccaac 2040 
atctcgttcc tttgacgtgt atccaccagc 2100 
catggaatgt ggccttccca ccctcrtctc 2160 
tcctttcaaa gcaatagaag actatgaccc 2220 
gaagtacgta cctctctctg tgataagaaa 2280 
cttgtacgtc aaattgtttt gtttaaatcu 2340 
cctactataa gaaacaagta taatcagaaa 2400 
cantszggaa tgtctttttc gtttacagtt 2460 
tgattctctc attctceagc ttgtccagac 2520 
tc^ttgugaa ttataaataa catgetaact 2580 
gttttcaacc ctctgactcc tegggagaga 2640 
ggtgctcatc taaagacaga ggcatgatgc 2700 
cccattatat cccccz-zztg gtttgeaata 2760 
tgcacctzat gctatcaagc gttaaaggta 2820 
ggtcactacc ttgccccaag tggcaaaccc 2SS0 
atttatgaaa ccgaaggttc cctcgtctca 2940 
caggaacgea aagtctcctg caicagccta 2000 
ttatcaaaca acnaaaattt aagtactttc 3060 
cageggctta aagcgggaag agetgetgea 3 120 
agcaaaacaa aactaaccca cctctgaacc 3 ISO 



BASF-NAE 33 77 / 99 PCT 



57/58 



12.09.2000 



tcatattatt 
tucatatctt 
cctataactg 
actactgaag 
actgcaatat 
ctatcactca 
tccccaggta 
ctgg t tacgt 
cctcaagaac 
aacacgatgg 
tcatagctaa 
agagcat tec 



aggagtagtc 
gat tgtgcaa 
gggatgagac 
actaagataa 
tgactctccg 
ctctcttggt 
ctctt Ltcta 
gt tgatttac 
ttgtattaat 
aagcgacgta 
catgacaaaa 
ggggaagaga 



gtgcttttaa 
tatctgeagg 
ggtaagctca 
tacttgette 
cnacttttat 
gcaagaat tg 
gttcctcacc 
ctccaacttg 
ctaaacgaga 
catgtggaac 
gcaccaaggg 
accgcagtct 



aaaatttgtt 
tctggaactg 
gaagttggtt 
t ggaacactg 
tgattatgaa 
gctcggacct 
ttatatagat 
ttctttctaa 
ttctcat tgg 
taaatgt tga 
t taagtacat 
gggagcttga 



ttaagaaacc 
tggttgatgg 
ttgaaattat 
ettgetatgt 
at tgatctct 
aaagt taaca 
caaactt taa 
aaatcatata 
gaaaauaaaa 
tcacgagcat 
aaccttttac 
taaaagtggg 



gaaaaactag 
gaacgctgga 
ettcttgeaa 
tctctagtac 
tataggtacc 
taacaatggc 
gtgtactt t t 
tctctgtact 
caacagccag 
gggtcagaca 
gaagactctg 
tattaa 



3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3896 



► ' 1 
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<210> 31 
<211> 709 
<212> cDNA 
<213> tomato 

<4Q0> 31 

ctggggccaa aagtgaacat aacaaggaca ccacagtcag agcatgatgt ncagatgtac 60 

aagtgcatct aaatatagag catcaacatg gtgaagatat cattcccaat atgacaaagt 120 

tacctacaat gaagtacata acctattatg aggattctga aagttttcca gggacaagaa 180 

cagcagtttg gaagcttgat aaagcaaatc acaggaacat tgtcagatct ccagctttga 240 

tgcgggagct atagcctgag atgtggcatg atattcatcc tgataaaaag tccaagtttg 300 

ttacaaaagg tggtgtctga tcctcactat tttcttctat aaacgtttga gtttgtattg 360 

acattgtaag tattgcaaca aaaagcaaag cgtgggcctc tgagggatga ggactgctat 420 

tgggattacg agaaagctcg atgtgcatgg gctgaacatt gtgaatacag gttagaatat 480 

tcaaattata ttttgcaaaa tattctcttt ttgtgtatct aggccacctt tccccggtca 540 

caacgatgca gatatgtatt cggggatgtt cacctgggac agagttgcag attgacigagt 600 

tctacacctc acatcctgtc acactatgtg tgatatttaa gaaactttgt ttggcggaac 660 

aacaagtttg cacaaacatt tgaagaacaa agcgaaatga cccagagag 709 



